Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning

Lai, Yunghwei, Liu, Kaiming, Wang, Ziyue, Ma, Weizhi, Liu, Yang

Oct-7-2025–arXiv.org Artificial Intelligence

The professionalism of a human doctor in outpatient service depends on two core abilities: the ability to make accurate medical decisions and the medical consultation skill to conduct strategic, empathetic patient inquiry. Existing Large Language Models (LLMs) have achieved remarkable accuracy on medical decision-making benchmarks. However, they often lack the ability to conduct the strategic and empathetic consultation, which is essential for real-world clinical scenarios. To address this gap, we propose Doctor-R1, an AI doctor agent trained to master both of the capabilities by ask high-yield questions and conduct strategic multi-turn inquiry to guide decision-making. Our framework introduces three key components: a multi-agent interactive environment, a two-tiered reward architecture that separately optimizes clinical decision-making and communicative inquiry skills, and an experience repository to ground policy learning in high-quality prior trajectories. We evaluate Doctor-R1 on OpenAI's HealthBench and MAQuE, assessed across multi-facet metrics, such as communication quality, user experience, and task accuracy. Remarkably, Doctor-R1 surpasses state-of-the-art open-source specialized LLMs by a substantial margin with higher parameter efficiency and outperforms powerful proprietary models. Furthermore, the human evaluations show a strong preference for Doctor-R1 to generate human-preferred clinical dialogue, demonstrating the effectiveness of the framework.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Oct-7-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Austria (0.28)
- Asia > Middle East
  - UAE (0.28)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Health & Medicine
  - Health Care Providers & Services (1.00)
  - Diagnostic Medicine (1.00)
  - Pharmaceuticals & Biotechnology (0.93)
  - Therapeutic Area
    - Pulmonary/Respiratory Diseases (1.00)
    - Infections and Infectious Diseases (1.00)
    - Immunology (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found