AITopics | Wilbur, W. John

Collaborating Authors

Wilbur, W. John

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine

Yang, Yifan, Jin, Qiao, Leaman, Robert, Liu, Xiaoyu, Xiong, Guangzhi, Sarfo-Gyamfi, Maame, Gong, Changlin, Ferrière-Steinert, Santiago, Wilbur, W. John, Li, Xiaojun, Yuan, Jiaxin, An, Bang, Castro, Kelvin S., Álvarez, Francisco Erramuspe, Stockle, Matías, Zhang, Aidong, Huang, Furong, Lu, Zhiyong

arXiv.org Artificial IntelligenceNov-20-2024

The remarkable capabilities of Large Language Models (LLMs) make them increasingly compelling for adoption in real-world healthcare applications. However, the risks associated with using LLMs in medical applications have not been systematically characterized. We propose using five key principles for safe and trustworthy medical AI: Truthfulness, Resilience, Fairness, Robustness, and Privacy, along with ten specific aspects. Under this comprehensive framework, we introduce a novel MedGuard benchmark with 1,000 expert-verified questions. Our evaluation of 11 commonly used LLMs shows that the current language models, regardless of their safety alignment mechanisms, generally perform poorly on most of our benchmarks, particularly when compared to the high performance of human physicians. Despite recent reports indicate that advanced LLMs like ChatGPT can match or even exceed human performance in various medical tasks, this study underscores a significant safety gap, highlighting the crucial need for human oversight and the implementation of AI safety guardrails.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2411.14487

Country:

Asia (0.93)
North America > United States > Maryland (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.68)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Health Care Providers & Services (0.93)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval

Jin, Qiao, Kim, Won, Chen, Qingyu, Comeau, Donald C., Yeganova, Lana, Wilbur, W. John, Lu, Zhiyong

arXiv.org Artificial IntelligenceOct-3-2023

Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we introduce MedCPT, a first-of-its-kind Contrastively Pre-trained Transformer model for zero-shot semantic IR in biomedicine. To train MedCPT, we collected an unprecedented scale of 255 million user click logs from PubMed. With such data, we use contrastive learning to train a pair of closely-integrated retriever and re-ranker. Experimental results show that MedCPT sets new state-of-the-art performance on six biomedical IR tasks, outperforming various baselines including much larger models such as GPT-3-sized cpt-text-XL. In addition, MedCPT also generates better biomedical article and sentence representations for semantic evaluations. As such, MedCPT can be readily applied to various real-world biomedical IR tasks.

information retrieval, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1093/bioinformatics/btad651

2307.00589

Country:

Europe (0.68)
North America > United States (0.28)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records

Chen, Qingyu, Du, Jingcheng, Kim, Sun, Wilbur, W. John, Lu, Zhiyong

arXiv.org Machine LearningSep-6-2019

Capturing sentence semantics plays a vital role in a range of text mining applications. Despite continuous efforts on the development of related datasets and models in the general domain, both datasets and models are limited in biomedical and clinical domains. The BioCreative/OHNLP organizers have made the first attempt to annotate 1,068 sentence pairs from clinical notes and have called for a community effort to tackle the Semantic Textual Similarity (BioCreative/OHNLP STS) challenge. We developed models using traditional machine learning and deep learning approaches. For the post challenge, we focus on two models: the Random Forest and the Encoder Network. We applied sentence embeddings pre-trained on PubMed abstracts and MIMIC-III clinical notes and updated the Random Forest and the Encoder Network accordingly. The official results demonstrated our best submission was the ensemble of eight models. It achieved a Person correlation coefficient of 0.8328, the highest performance among 13 submissions from 4 teams. For the post challenge, the performance of both Random Forest and the Encoder Network was improved; in particular, the correlation of the Encoder Network was improved by ~13%. During the challenge task, no end-to-end deep learning models had better performance than machine learning models that take manually-crafted features. In contrast, with the sentence embeddings pre-trained on biomedical corpora, the Encoder Network now achieves a correlation of ~0.84, which is higher than the original best model. The ensembled model taking the improved versions of the Random Forest and Encoder Network as inputs further increased performance to 0.8528. Deep learning models with sentence embeddings pre-trained on biomedical corpora achieve the highest performance on the test set.

deep learning, neural network, similarity, (21 more...)

arXiv.org Machine Learning

1909.03044

Country: North America > United States (0.68)

Genre: Research Report (0.64)

Industry: Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Automatic Identification of Key Concepts in Large PubMed Retrievals

Yeganova, Lana (National Library of Medicine, National Institutes of Health) | Grigoryan, Vahan (National Library of Medicine, National Institutes of Health) | Kim, Won (National Library of Medicine, National Institutes of Health) | Wilbur, W. John (National Library of Medicine, National Institutes of Health)

AAAI ConferencesNov-5-2012

PubMed queries frequently retrieve thousands of documents making it very challenging for a user to identify information of interest. In this paper we propose a method for automatically identifying central concepts in large PubMed retrievals. The centrality of concept is modeled using the hypergeometric distribution. Retrieved documents are grouped by concept, which can help users navigate the retrieval. We test our method on five datasets, each representing a medical condition.

attention deficit hyperactivity disorder, disorder, epilepsy, (19 more...)

AAAI Conferences

2012 AAAI Fall Symposium Series

Country: North America > United States (0.29)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.78)
Health & Medicine > Therapeutic Area > Genetic Disease (0.74)
Health & Medicine > Therapeutic Area > Neurology > Attention Deficit/Hyperactivity Disorder (0.56)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

PROBE: Periodic Random Orbiter Algorithm for Machine Learning

Smith, Larry (National Institutes of Health) | Kim, Won (National Institutes of Health) | Wilbur, W. John

AAAI ConferencesNov-5-2012

We present a new algorithm, which we call PROBE, to find the minimum of a convex function. Such a minimization is important in many machine learning methods, including Support Vector Machines (SVM). We show that PROBE is a viable alternative to published algorithms for SVM learning with several important advantages. PROBE is a simple and easily programmed algorithm, with a well-defined, parametrized stopping criterion; it is not limited to SVM, but can be applied to other convex loss functions, such as the Huber and Maximum Entropy models; and its time and memory requirements are consistently modest in handling very large training sets.

algorithm, health & medicine, immunology, (17 more...)

AAAI Conferences

2012 AAAI Fall Symposium Series

Country: North America > United States (0.28)

Genre: Research Report (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.68)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)

Add feedback