From Knowledge Generation to Knowledge Verification: Examining the BioMedical Generative Capabilities of ChatGPT
Hamed, Ahmed Abdeen, Lee, Byung Suk
–arXiv.org Artificial Intelligence
The generative capabilities of LLM models present opportunities in accelerating tasks and concerns with the authenticity of the knowledge it produces. To address the concerns, we present a computational approach that systematically evaluates the factual accuracy of biomedical knowledge that an LLM model has been prompted to generate. Our approach encompasses two processes: the generation of disease-centric associations and the verification of them using the semantic knowledge of the biomedical ontologies. Using ChatGPT as the select LLM model, we designed a set of prompt-engineering processes to generate linkages between diseases, drugs, symptoms, and genes to establish grounds for assessments. Experimental results demonstrate high accuracy in identifying disease terms (88%-97%), drug names (90%-91%), and genetic information (88%-98%). The symptom term identification accuracy was notably lower (49%-61%), as verified against the DOID, ChEBI, SYMPTOM, and GO ontologies accordingly. The verification of associations reveals literature coverage rates of (89%-91%) among disease-drug and disease-gene associations. The low identification accuracy for symptom terms also contributed to the verification of symptom-related associations (49%-62%).
arXiv.org Artificial Intelligence
Feb-20-2025
- Country:
- Europe
- Poland > Lesser Poland Province
- Kraków (0.04)
- Spain > Andalusia
- Málaga Province > Málaga (0.04)
- United Kingdom > England
- Kent > Canterbury (0.04)
- Poland > Lesser Poland Province
- North America > United States
- District of Columbia > Washington (0.04)
- New York > Broome County
- Binghamton (0.04)
- Vermont > Chittenden County
- Burlington (0.04)
- South America > Colombia (0.04)
- Europe
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: