ACR: A Benchmark for Automatic Cohort Retrieval

Thai, Dung Ngoc, Ardulov, Victor, Mena, Jose Ulises, Tiwari, Simran, Erofeev, Gleb, Eskander, Ramy, Tarabishy, Karim, Parikh, Ravi B, Salloum, Wael

Jul-1-2024–arXiv.org Artificial Intelligence

Identifying patient cohorts is fundamental to numerous healthcare tasks, including clinical trial recruitment and retrospective studies. Current cohort retrieval methods in healthcare organizations rely on automated queries of structured data combined with manual curation, which are time-consuming, labor-intensive, and often yield low-quality results. Recent advancements in large language models (LLMs) and information retrieval (IR) offer promising avenues to revolutionize these systems. Major challenges include managing extensive eligibility criteria and handling the longitudinal nature of unstructured Electronic Medical Records (EMRs) while ensuring that the solution remains cost-effective for real-world application. This paper introduces a new task, Automatic Cohort Retrieval (ACR), and evaluates the performance of LLMs and commercial, domain-specific neuro-symbolic approaches. We provide a benchmark task, a query dataset, an EMR dataset, and an evaluation framework. Our findings underscore the necessity for efficient, high-quality ACR systems capable of longitudinal reasoning across extensive patient databases.

cohort, query, reasoning, (12 more...)

arXiv.org Artificial Intelligence

Jul-1-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania (0.04)
  - Alaska (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (1.00)
  - Health Care Technology > Medical Record (1.00)
  - Health Care Providers & Services (1.00)
  - Therapeutic Area
    - Oncology > Carcinoma (1.00)
    - Obstetrics/Gynecology (0.95)
    - Dermatology (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.70)
    - Information Retrieval (0.67)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found