The UniD³-generated datasets achieved F1 scores exceeding 0.80 across all three tasks, with expert validation scores reaching 0.9005 F1 in the DDM task.
The system leverages over 150,000 drug-related publications from PubMed, processed using Llama3.3-70B with carefully designed prompts.
Yes, all datasets (DDM, DEA, DTA) are available on HuggingFace and can be accessed using pandas or the HuggingFace datasets library.