Goto

Collaborating Authors

 treatment regimen


Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology

Wang, Peilong, Liu, Zhengliang, Li, Yiwei, Holmes, Jason, Shu, Peng, Zhang, Lian, Li, Xiang, Li, Quanzheng, Laughlin, Brady S., Toesca, Diego Santos, Vora, Sujay A., Patel, Samir H., Sio, Terence T., Liu, Tianming, Liu, Wei

arXiv.org Artificial Intelligence

Background: The radiation oncology clinical practice involves many steps relying on the dynamic interplay of abundant text data. Large language models have displayed remarkable capabilities in processing complex text information. But their direct applications in specific fields like radiation oncology remain underexplored. Purpose: This study aims to investigate whether fine-tuning LLMs with domain knowledge can improve the performance on Task (1) treatment regimen generation, Task (2) treatment modality selection (photon, proton, electron, or brachytherapy), and Task (3) ICD-10 code prediction in radiation oncology. Methods: Data for 15,724 patient cases were extracted. Cases where patients had a single diagnostic record, and a clearly identifiable primary treatment plan were selected for preprocessing and manual annotation to have 7,903 cases of the patient diagnosis, treatment plan, treatment modality, and ICD-10 code. Each case was used to construct a pair consisting of patient diagnostics details and an answer (treatment regimen, treatment modality, or ICD-10 code respectively) for the supervised fine-tuning of these three tasks. Open source LLaMA2-7B and Mistral-7B models were utilized for the fine-tuning with the Low-Rank Approximations method. Accuracy and ROUGE-1 score were reported for the fine-tuned models and original models. Clinical evaluation was performed on Task (1) by radiation oncologists, while precision, recall, and F-1 score were evaluated for Task (2) and (3). One-sided Wilcoxon signed-rank tests were used to statistically analyze the results. Results: Fine-tuned LLMs outperformed original LLMs across all tasks with p-value <= 0.001. Clinical evaluation demonstrated that over 60% of the fine-tuned LLMs-generated treatment regimens were clinically acceptable. Precision, recall, and F1-score showed improved performance of fine-tuned LLMs.


CT-ADE: An Evaluation Benchmark for Adverse Drug Event Prediction from Clinical Trial Results

Yazdani, Anthony, Bornet, Alban, Zhang, Boya, Khlebnikov, Philipp, Amini, Poorya, Teodoro, Douglas

arXiv.org Artificial Intelligence

Adverse drug events (ADEs) significantly impact clinical research and public health, contributing to failures in clinical trials and leading to increased healthcare costs. The accurate prediction and management of ADEs are crucial for improving the development of safer, more effective medications, and enhancing patient outcomes. To support this effort, we introduce CT-ADE, a novel dataset compiled to enhance the predictive modeling of ADEs. Encompassing over 12,000 instances extracted from clinical trial results, the CT-ADE dataset integrates drug, patient population, and contextual information for multilabel ADE classification tasks in monopharmacy treatments, providing a comprehensive resource for developing advanced predictive models. To mirror the complex nature of ADEs, annotations are standardized at the system organ class level of the Medical Dictionary for Regulatory Activities (MedDRA) ontology. Preliminary analyses using baseline models have demonstrated promising results, achieving 73.33% F1 score and 81.54% balanced accuracy, highlighting CT-ADE's potential to advance ADE prediction. CT-ADE provides an essential tool for researchers aiming to leverage the power of artificial intelligence and machine learning to enhance patient safety and minimize the impact of ADEs on pharmaceutical research and development. Researchers interested in using the CT-ADE dataset can find all necessary resources at https://github.com/xxxx/xxxx.


RadOnc-GPT: A Large Language Model for Radiation Oncology

Liu, Zhengliang, Wang, Peilong, Li, Yiwei, Holmes, Jason, Shu, Peng, Zhang, Lian, Liu, Chenbin, Liu, Ninghao, Zhu, Dajiang, Li, Xiang, Li, Quanzheng, Patel, Samir H., Sio, Terence T., Liu, Tianming, Liu, Wei

arXiv.org Artificial Intelligence

This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods. RadOnc-GPT was finetuned on a large dataset of radiation oncology patient records from the Mayo Clinic in Arizona. The model employs instruction tuning on three key tasks - generating radiotherapy treatment regimens, determining optimal radiation modalities, and providing diagnostic descriptions/ICD codes based on patient diagnostic details. Evaluations conducted by comparing RadOnc-GPT outputs to general large language model outputs showed higher ROUGE scores in these three tasks. The study demonstrated the potential of using large language models fine-tuned using domain-specific knowledge like RadOnc-GPT to achieve transformational capabilities in highly specialized healthcare fields such as radiation oncology. However, our model's clinical relevance requires confirmation, and it specializes in only the aforementioned three specific tasks and lacks broader applicability. Furthermore, its evaluation through ROUGE scores might not reflect the true semantic and clinical accuracy - challenges we intend to address in future research.


How doctors are using machine learning to improve health outcomes

#artificialintelligence

An ounce of prevention is worth a pound of cure, as the old saying goes. Until recently, that simply meant living a healthy lifestyle, getting regular checkups, and hoping that signs of anything serious were caught early. But today, doctors are using artificial intelligence (AI) and machine learning systems to make preventative care, diagnosis, and treatment more accurate and effective than ever. "Machine learning involves adaptive learning and as such, can identify patterns over time as new data is aggregated and analyzed," explains Melissa Manice, co-founder of healthcare startup Cohero Health. "Therefore, machine learning and AI allows doctors to detect abnormal behaviors and predictive insights with the application of clinical thresholds to machine learning algorithms," she continues.


Artificial intelligence helps to treat tuberculosis more effectively - Medical News Bulletin Health News and Medical Research

#artificialintelligence

The spread of tuberculosis (TB) has diminished in the developed world, but it is still prevalent in the developing parts of the world such as in Asia and Africa. The rise of HIV in the 1980's also saw an increase in TB infections due to the weakened immune systems of patients with HIV. Currently about 1.6 million people die from TB each year, and 10 million people develop active TB infections, which is also contagious. Tuberculosis is caused by Mycobacterium tuberculosis bacteria and it generally affects the lungs. Individuals can harbor the TB bacteria but show no symptoms.


Learning Treatment Regimens from Electronic Medical Records

Hoang, Khanh-Hung, Ho, Tu-Bao

arXiv.org Artificial Intelligence

Appropriate treatment regimens play a vital role in improving patient health status. Although some achievements have been made, few of the recent studies of learning treatment regimens have exploited different kinds of patient information due to the difficulty in adopting heterogeneous data to many data mining methods. Moreover, current studies seem too rigid with fixed intervals of treatment periods corresponding to the varying lengths of hospital stay. To this end, this work proposes a generic data-driven framework which can derive group-treatment regimens from electronic medical records by utilizing a mixed-variate restricted Boltzmann machine and incorporating medical domain knowledge. We conducted experiments on coronary artery disease as a case study. The obtained results show that the framework is promising and capable of assisting physicians in making clinical decisions.


Ranking and Selection with Covariates for Personalized Decision Making

Shen, Haihui, Hong, L. Jeff, Zhang, Xiaowei

arXiv.org Machine Learning

We consider a ranking and selection problem in the context of personalized decision making, where the best alternative is not universal but varies as a function of observable covariates. The goal of ranking and selection with covariates (R&S-C) is to use sampling to compute a decision rule that can specify the best alternative with certain statistical guarantee for each subsequent individual after observing his or her covariates. A linear model is proposed to capture the relationship between the mean performance of an alternative and the covariates. Under the indifference-zone formulation, we develop two-stage procedures for both homoscedastic and heteroscedastic sampling errors, respectively, and prove their statistical validity, which is defined in terms of probability of correct selection. We also generalize the well-known slippage configuration, and prove that the generalized slippage configuration is the least favorable configuration of our procedures. Extensive numerical experiments are conducted to investigate the performance of the proposed procedures. Finally, we demonstrate the usefulness of R&S-C via a case study of selecting the best treatment regimen in the prevention of esophageal cancer. We find that by leveraging disease-related personal information, R&S-C can improve substantially the expected quality-adjusted life years for some groups of patients through providing patient-specific treatment regimen.


Knowledge-Based Avoidance of Drug-Resistant HIV Mutants

Lathrop, Richard H., Steffen, Nicholas R., Raphael, Miriam P., Deeds-Rubin, Sophia, Cimoch, Paul J., See, Darryl M., Tilles, Jeremiah G.

AI Magazine

We describe an AI system (CTSHIV) that connects the scientific AIDS literature describing specific human immunodeficiency virus (HIV) drug resistances directly to the customized treatment strategy of a specific HIV patient. Rules in the CTSHIV knowledge base encode knowledge about sequence mutations in the HIV genome that have been found to result in drug resistance to the HIV virus. Rules are applied to the actual HIV sequences of the virus strains infecting the specific patient undergoing clinical treatment to infer current drug resistance. A rule-directed search through mutation sequence space identifies nearby drug-resistant mutant strains that might arise. The possible combination drug-treatment regimens currently approved by the U.S. Food and Drug Administration are considered and ranked by their estimated ability to avoid identified current and nearby drug-resistant mutants. The highest-ranked treatments are recommended to the attending physician. The result is more precise treatment of individual HIV patients and a decreased tendency to select for drug-resistant genes in the global HIV gene pool. Initial results from a small human clinical trial are encouraging, and further clinical trials are planned. From an AI viewpoint, the case study demonstrates the extensibility of knowledge-based systems because it illustrates how existing encoded knowledge can be used to support new knowledge-based applications that were unanticipated when the original knowledge was encoded.