Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs

Karim, Ahmed Akib Jawad, Mahmud, Muhammad Zawad, Islam, Samiha, Azam, Aznur

Nov-19-2024–arXiv.org Artificial Intelligence

In this research, we explored the improvement in terms of multi-class disease classification via pre-trained language models over Medical-Abstracts-TC-Corpus that spans five medical conditions. We excluded non-cancer conditions and examined four specific diseases. We assessed four LLMs, BioBERT, XLNet, and BERT, as well as a novel base model (Last-BERT). BioBERT, which was pre-trained on medical data, demonstrated superior performance in medical text classification (97% accuracy). Surprisingly, XLNet followed closely (96% accuracy), demonstrating its generalizability across domains even though it was not pre-trained on medical data. LastBERT, a custom model based on the lighter version of BERT, also proved competitive with 87.10% accuracy (just under BERT's 89.33%). Our findings confirm the importance of specialized models such as BioBERT and also support impressions around more general solutions like XLNet and well-tuned transformer architectures with fewer parameters (in this case, LastBERT) in medical domain tasks.

classification, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Nov-19-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Orange County > Irvine (0.04)
- Asia > Bangladesh
  - Dhaka Division > Dhaka District > Dhaka (0.05)

Genre:
- Research Report > New Finding (0.69)

Industry:
- Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning
    - Performance Analysis > Accuracy (1.00)
    - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found