AITopics | Devarakonda, Murthy

Collaborating Authors

Devarakonda, Murthy

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

sc-OTGM: Single-Cell Perturbation Modeling by Solving Optimal Mass Transport on the Manifold of Gaussian Mixtures

Demir, Andac, Solovyeva, Elizaveta, Boylan, James, Xiao, Mei, Serluca, Fabrizio, Hoersch, Sebastian, Jenkins, Jeremy, Devarakonda, Murthy, Kiziltan, Bulent

arXiv.org Artificial IntelligenceMay-6-2024

Influenced by breakthroughs in LLMs, single-cell foundation models are emerging. While these models show successful performance in cell type clustering, phenotype classification, and gene perturbation response prediction, it remains to be seen if a simpler model could achieve comparable or better results, especially with limited data. This is important, as the quantity and quality of single-cell data typically fall short of the standards in textual data used for training LLMs. Single-cell sequencing often suffers from technical artifacts, dropout events, and batch effects. These challenges are compounded in a weakly supervised setting, where the labels of cell states can be noisy, further complicating the analysis. To tackle these challenges, we present sc-OTGM, streamlined with less than 500K parameters, making it approximately 100x more compact than the foundation models, offering an efficient alternative. sc-OTGM is an unsupervised model grounded in the inductive bias that the scRNAseq data can be generated from a combination of the finite multivariate Gaussian distributions. The core function of sc-OTGM is to create a probabilistic latent space utilizing a GMM as its prior distribution and distinguish between distinct cell populations by learning their respective marginal PDFs. It uses a Hit-and-Run Markov chain sampler to determine the OT plan across these PDFs within the GMM framework. We evaluated our model against a CRISPR-mediated perturbation dataset, called CROP-seq, consisting of 57 one-gene perturbations. Our results demonstrate that sc-OTGM is effective in cell state classification, aids in the analysis of differential gene expression, and ranks genes for target identification through a recommender system. It also predicts the effects of single-gene perturbations on downstream gene regulation and generates synthetic scRNA-seq data conditioned on specific cell states.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2405.03726

Country: North America > United States (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Customizing Knowledge Graph Embedding to Improve Clinical Study Recommendation

Liu, Xiong, Khalil, Iya, Devarakonda, Murthy

arXiv.org Artificial IntelligenceDec-28-2022

Inferring knowledge from clinical trials using knowledge graph embedding is an emerging area. However, customizing graph embeddings for different use cases remains a significant challenge. We propose custom2vec, an algorithmic framework to customize graph embeddings by incorporating user preferences in training the embeddings. It captures user preferences by adding custom nodes and links derived from manually vetted results of a separate information retrieval method. We propose a joint learning objective to preserve the original network structure while incorporating the user's custom annotations. We hypothesize that the custom training improves user-expected predictions, for example, in link prediction tasks. We demonstrate the effectiveness of custom2vec for clinical trials related to non-small cell lung cancer (NSCLC) with two customization scenarios: recommending immuno-oncology trials evaluating PD-1 inhibitors and exploring similar trials that compare new therapies with a standard of care. The results show that custom2vec training achieves better performance than the conventional training methods. Our approach is a novel way to customize knowledge graph embeddings and enable more accurate recommendations and predictions.

artificial intelligence, information retrieval, natural language, (2 more...)

arXiv.org Artificial Intelligence

2212.14102

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.89)

Industry: Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.80)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.53)

Add feedback

A Scalable AI Approach for Clinical Trial Cohort Optimization

Liu, Xiong, Shi, Cheng, Deore, Uday, Wang, Yingbo, Tran, Myah, Khalil, Iya, Devarakonda, Murthy

arXiv.org Artificial IntelligenceSep-6-2021

FDA has been promoting enrollment practices that could enhance the diversity of clinical trial populations, through broadening eligibility criteria. However, how to broaden eligibility remains a significant challenge. We propose an AI approach to Cohort Optimization (AICO) through transformer-based natural language processing of the eligibility criteria and evaluation of the criteria using real-world data. The method can extract common eligibility criteria variables from a large set of relevant trials and measure the generalizability of trial designs to real-world patients. It overcomes the scalability limits of existing manual methods and enables rapid simulation of eligibility criteria design for a disease of interest. A case study on breast cancer trial design demonstrates the utility of the method in improving trial generalizability.

deep learning, eligibility criteria, neural network, (23 more...)

arXiv.org Artificial Intelligence

2109.02808

Country: North America > United States (0.67)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government > Regional Government > North America Government > United States Government > FDA (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Toward Generating Domain-Specific / Personalized Problem Lists from Electronic Medical Records

Tsou, Ching-Huei (IBM) | Devarakonda, Murthy (IBM) | Liang, Jennifer J. (IBM)

AAAI ConferencesNov-1-2015

An accurate problem list plays the key role of a problem-oriented medical record, which plays a significant role in improving patient care. However, the multi-author, multi-purpose nature of problem list makes it a challenge to maintain, and a single list is difficult, if not impossible, to satisfy all the needs of different practitioners. In this paper, we propose using machine generated problem list to assist a medical practitioner to review a patient’s chart. The proposed system scans both structured and unstructured data in a patient’s electronic medical record (EMR) and generates a ranked, recall-oriented problem list grouped by body systems. Details of each problem are readily available for the user to assess the correctness and relevance of the problem. The user can then provide feedback to the system on the trustworthiness of each evidence passage retrieved, as well as the validity of the problem as a whole. The user-specific feedback provides new information the system needs to perform active learning to learn the user’s preference and produce personalized, and/or domain-specific problem lists.

health & medicine, medical record, problem list, (17 more...)

AAAI Conferences

2015 AAAI Fall Symposium Series

Industry: Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Automated Problem List Generation from Electronic Medical Records in IBM Watson

Devarakonda, Murthy (IBM Research and Watson Group) | Tsou, Ching-Huei (IBM Research and Watson Group)

AAAI ConferencesMar-15-2015

Identifying a patient’s important medical problems requires broad and deep medical expertise, as well as significant time to gather all the relevant facts from the patient’s medical record and assess the clinical importance of the facts in reaching the final conclusion. A patient’s medical problem list is by far the most critical information that a physician uses in treatment and care of a patient. In spite of its critical role, its curation, manual or automated, has been an unmet need in clinical practice. We developed a machine learning technique in IBM Watson to automatically generate a patient’s medical problem list. The machine learning model uses lexical and medical features extracted from a patient’s record using NLP techniques. We show that the automated method achieves 70% recall and 67% precision based on the gold standard that medical experts created on a set of de-identified patient records from a major hospital system in the US. To the best of our knowledge this is the first successful machine learning/NLP method of extracting an open-ended patient’s medical problems from an Electronic Medical Record (EMR). This paper also contributes a methodology for assessing accuracy of a medical problem list generation technique.

automated problem list generation, electronic medical record, ibm watson

AAAI Conferences

Twenty-Seventh IAAI Conference

Country: North America > United States (0.24)

Industry: Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Case Based Reasoning (0.60)

Add feedback