AITopics | Callahan, Tiffany J.

Collaborating Authors

Callahan, Tiffany J.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Agentic Mixture-of-Workflows for Multi-Modal Chemical Search

Callahan, Tiffany J., Park, Nathaniel H., Capponi, Sara

arXiv.org Artificial IntelligenceFeb-26-2025

The vast and complex materials design space demands innovative strategies to integrate multidisciplinary scientific knowledge and optimize materials discovery. While large language models (LLMs) have demonstrated promising reasoning and automation capabilities across various domains, their application in materials science remains limited due to a lack of benchmarking standards and practical implementation frameworks. To address these challenges, we introduce Mixture-of-Workflows for Self-Corrective Retrieval-Augmented Generation (CRAG-MoW) - a novel paradigm that orchestrates multiple agentic workflows employing distinct CRAG strategies using open-source LLMs. Unlike prior approaches, CRAG-MoW synthesizes diverse outputs through an orchestration agent, enabling direct evaluation of multiple LLMs across the same problem domain. We benchmark CRAG-MoWs across small molecules, polymers, and chemical reactions, as well as multi-modal nuclear magnetic resonance (NMR) spectral retrieval. Our results demonstrate that CRAG-MoWs achieve performance comparable to GPT-4o while being preferred more frequently in comparative evaluations, highlighting the advantage of structured retrieval and multi-agent synthesis. By revealing performance variations across data types, CRAG-MoW provides a scalable, interpretable, and benchmark-driven approach to optimizing AI architectures for materials discovery. These insights are pivotal in addressing fundamental gaps in benchmarking LLMs and autonomous AI agents for scientific applications.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2502.19629

Country: North America > United States (0.27)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)

Industry:

Materials > Chemicals > Commodity Chemicals > Petrochemicals (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy > Oil & Gas (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

RNA-KG: An ontology-based knowledge graph for representing interactions involving RNA molecules

Cavalleri, Emanuele, Cabri, Alberto, Soto-Gomez, Mauricio, Bonfitto, Sara, Perlasca, Paolo, Gliozzo, Jessica, Callahan, Tiffany J., Reese, Justin, Robinson, Peter N, Casiraghi, Elena, Valentini, Giorgio, Mesiti, Marco

arXiv.org Artificial IntelligenceNov-30-2023

The "RNA world" represents a novel frontier for the study of fundamental biological processes and human diseases and is paving the way for the development of new drugs tailored to the patient's biomolecular characteristics. Although scientific data about coding and non-coding RNA molecules are continuously produced and available from public repositories, they are scattered across different databases and a centralized, uniform, and semantically consistent representation of the "RNA world" is still lacking. We propose RNA-KG, a knowledge graph encompassing biological knowledge about RNAs gathered from more than 50 public databases, integrating functional relationships with genes, proteins, and chemicals and ontologically grounded biomedical concepts. To develop RNA-KG, we first identified, pre-processed, and characterized each data source; next, we built a meta-graph that provides an ontological description of the KG by representing all the bio-molecular entities and medical concepts of interest in this domain, as well as the types of interactions connecting them. Finally, we leveraged an instance-based semantically abstracted knowledge model to specify the ontological alignment according to which RNA-KG was generated. RNA-KG can be downloaded in different formats and also queried by a SPARQL endpoint. A thorough topological analysis of the resulting heterogeneous graph provides further insights into the characteristics of the "RNA world". RNA-KG can be both directly explored and visualized, and/or analyzed by applying computational methods to infer bio-medical knowledge from its heterogeneous nodes and edges. The resource can be easily updated with new experimental data, and specific views of the overall KG can be extracted according to the bio-medical problem to be studied.

artificial intelligence, nucleic acid res, rna-kg, (14 more...)

arXiv.org Artificial Intelligence

2312.00183

Country:

Europe (0.67)
North America > United States > New York (0.14)

Genre: Research Report > Experimental Study (0.45)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Hematology (0.67)
Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback

An Open-Source Knowledge Graph Ecosystem for the Life Sciences

Callahan, Tiffany J., Tripodi, Ignacio J., Stefanski, Adrianne L., Cappelletti, Luca, Taneja, Sanya B., Wyrwa, Jordan M., Casiraghi, Elena, Matentzoglu, Nicolas A., Reese, Justin, Silverstein, Jonathan C., Hoyt, Charles Tapley, Boyce, Richard D., Malec, Scott A., Unni, Deepak R., Joachimiak, Marcin P., Robinson, Peter N., Mungall, Christopher J., Cavalleri, Emanuele, Fontana, Tommaso, Valentini, Giorgio, Mesiti, Marco, Gillenwater, Lucas A., Santangelo, Brook, Vasilevsky, Nicole A., Hoehndorf, Robert, Bennett, Tellen D., Ryan, Patrick B., Hripcsak, George, Kahn, Michael G., Bada, Michael, Baumgartner, William A. Jr, Hunter, Lawrence E.

arXiv.org Artificial IntelligenceJul-11-2023

Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to automatically construct them. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG construction methods provide robust tooling at the cost of fixed or limited choices among knowledge representation models. PheKnowLator (Phenotype Knowledge Translator) is a semantic ecosystem for automating the FAIR (Findable, Accessible, Interoperable, and Reusable) construction of ontologically grounded KGs with fully customizable knowledge representation. The ecosystem includes KG construction resources (e.g., data preparation APIs), analysis tools (e.g., SPARQL endpoints and abstraction algorithms), and benchmarks (e.g., prebuilt KGs and embeddings). We evaluate the ecosystem by surveying open-source KG construction methods and analyzing its computational performance when constructing 12 large-scale KGs. With flexible knowledge representation, PheKnowLator enables fully customizable KGs without compromising performance or usability.

artificial intelligence, expert system, natural language, (15 more...)

arXiv.org Artificial Intelligence

2307.05727

Country:

Europe (0.67)
North America > United States > Colorado > Boulder County > Boulder (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (0.67)
Government > Regional Government > North America Government > United States Government (0.45)

Technology:

Information Technology > Communications > Web > Semantic Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(2 more...)

Add feedback

GRAPE for Fast and Scalable Graph Processing and random walk-based Embedding

Cappelletti, Luca, Fontana, Tommaso, Casiraghi, Elena, Ravanmehr, Vida, Callahan, Tiffany J., Cano, Carlos, Joachimiak, Marcin P., Mungall, Christopher J., Robinson, Peter N., Reese, Justin, Valentini, Giorgio

arXiv.org Artificial IntelligenceMay-7-2023

Graph Representation Learning (GRL) methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are beyond the capabilities of current methods and software implementations. We present GRAPE, a software resource for graph processing and embedding that can scale with big graphs by using specialized and smart data structures, algorithms, and a fast parallel implementation of random walk-based methods. Compared with state-of-the-art software resources, GRAPE shows an improvement of orders of magnitude in empirical space and time complexity, as well as a competitive edge and node label prediction performance. GRAPE comprises about 1.7 million well-documented lines of Python and Rust code and provides 69 node embedding methods, 25 inference models, a collection of efficient graph processing utilities and over 80,000 graphs from the literature and other sources. Standardized interfaces allow seamless integration of third-party libraries, while ready-to-use and modular pipelines permit an easy-to-use evaluation of GRL methods, therefore also positioning GRAPE as a software resource to perform a fair comparison between methods and libraries for graph processing and embedding.

data mining, machine learning, programming language, (21 more...)

arXiv.org Artificial Intelligence

2110.06196

Country:

Europe (0.67)
North America > United States > Virginia (0.14)
North America > United States > Texas (0.14)
(3 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Energy (0.87)
(2 more...)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
(4 more...)

Add feedback

Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

Callahan, Tiffany J., Stefanski, Adrianne L., Wyrwa, Jordan M., Zeng, Chenjie, Ostropolets, Anna, Banda, Juan M., Baumgartner, William A. Jr., Boyce, Richard D., Casiraghi, Elena, Coleman, Ben D., Collins, Janine H., Deakyne-Davies, Sara J., Feinstein, James A., Haendel, Melissa A., Lin, Asiyah Y., Martin, Blake, Matentzoglu, Nicolas A., Meeker, Daniella, Reese, Justin, Sinclair, Jessica, Taneja, Sanya B., Trinkley, Katy E., Vasilevsky, Nicole A., Williams, Andrew, Zhang, Xingman A., Denny, Joshua C., Robinson, Peter N., Ryan, Patrick, Hripcsak, George, Bennett, Tellen D., Hunter, Lawrence E., Kahn, Michael G.

arXiv.org Artificial IntelligenceJan-30-2023

Background: Common data models solve many challenges of standardizing electronic health record (EHR) data, but are unable to semantically integrate all the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OBO ontologies requires significant manual curation and domain expertise. Objective: We introduce OMOP2OBO, an algorithm for mapping Observational Medical Outcomes Partnership (OMOP) vocabularies to OBO ontologies. Results: Using OMOP2OBO, we produced mappings for 92,367 conditions, 8611 drug ingredients, and 10,673 measurement results, which covered 68-99% of concepts used in clinical practice when examined across 24 hospitals. When used to phenotype rare disease patients, the mappings helped systematically identify undiagnosed patients who might benefit from genetic testing. Conclusions: By aligning OMOP vocabularies to OBO ontologies our algorithm presents new opportunities to advance EHR-based deep phenotyping.

artificial intelligence, mapping, ontology, (12 more...)

arXiv.org Artificial Intelligence

2209.04732

Country:

North America > United States > Colorado (0.29)
North America > United States > California (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.27)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(6 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback

Developing a Knowledge Graph Framework for Pharmacokinetic Natural Product-Drug Interactions

Taneja, Sanya B., Callahan, Tiffany J., Paine, Mary F., Kane-Gill, Sandra L., Kilicoglu, Halil, Joachimiak, Marcin P., Boyce, Richard D.

arXiv.org Artificial IntelligenceSep-24-2022

Pharmacokinetic natural product-drug interactions (NPDIs) occur when botanical natural products are co-consumed with pharmaceutical drugs. Understanding mechanisms of NPDIs is key to preventing adverse events. We constructed a knowledge graph framework, NP-KG, as a step toward computational discovery of pharmacokinetic NPDIs. NP-KG is a heterogeneous KG with biomedical ontologies, linked data, and full texts of the scientific literature, constructed with the Phenotype Knowledge Translator framework and the semantic relation extraction systems, SemRep and Integrated Network and Dynamic Reasoning Assembler. NP-KG was evaluated with case studies of pharmacokinetic green tea- and kratom-drug interactions through path searches and meta-path discovery to determine congruent and contradictory information compared to ground truth data. The fully integrated NP-KG consisted of 745,512 nodes and 7,249,576 edges. Evaluation of NP-KG resulted in congruent (38.98% for green tea, 50% for kratom), contradictory (15.25% for green tea, 21.43% for kratom), and both congruent and contradictory (15.25% for green tea, 21.43% for kratom) information. Potential pharmacokinetic mechanisms for several purported NPDIs, including the green tea-raloxifene, green tea-nadolol, kratom-midazolam, kratom-quetiapine, and kratom-venlafaxine interactions were congruent with the published literature. NP-KG is the first KG to integrate biomedical ontologies with full texts of the scientific literature focused on natural products. We demonstrate the application of NP-KG to identify pharmacokinetic interactions involving enzymes, transporters, and pharmaceutical drugs. We envision that NP-KG will facilitate improved human-machine collaboration to guide researchers in future studies of pharmacokinetic NPDIs. The NP-KG framework is publicly available at https://doi.org/10.5281/zenodo.6814507 and https://github.com/sanyabt/np-kg.

artificial intelligence, natural language, text processing, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.jbi.2023.104341

2209.1195

Country:

Asia > Middle East > Israel > Mediterranean Sea (0.21)
North America > United States > Washington (0.14)
North America > United States > Missouri > Jackson County > Kansas City (0.14)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Knowledge-based Biomedical Data Science 2019

Callahan, Tiffany J., Pielke-Lombardo, Harrison, Tripodi, Ignacio J., Hunter, Lawrence E.

arXiv.org Artificial IntelligenceOct-8-2019

Knowledge-based biomedical data science (KBDS) involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey the progress in the last year in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing, and the expansion of knowledge-based approaches to novel domains, such as Chinese Traditional Medicine and biodiversity.

deep learning, knowledge graph, neural network, (18 more...)

arXiv.org Artificial Intelligence

1910.0671

Country: North America > United States > Colorado > Boulder County > Boulder (0.14)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Consumer Health (1.00)
(5 more...)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
(3 more...)

Add feedback