UniToxSupplementaryMaterials

Neural Information Processing Systems 

Drugs For what purpose was the dataset created? that do not have a current FDA-approved label UniTox was created as a unified toxicity dataset (e.g., withdrawn or discontinued drugs) are not across eight types of drug toxicities Each instance is a single drug. For each We generated information across all toxicities for instance, there are eight toxicities, and for each the same set of 2,418 drugs with the same toxicity, there is an LLM-generated summary of methodology of applying LLMs. For each drug, the relevant sections of the drug label, a ternary for each toxicity, we provide an LLM-generated prediction (No/Less/Most), and a binary summary of the relevant portions of the drug prediction (No/Yes). Each instance also provides label, as well as ternary (No/Less/Most) the unique SPL ID, allowing users to find the predictions and binary (No/Yes) predictions for exact text used to generate the instance data. Is there a label or target associated with each Who created the dataset (e.g., which team, instance?