CURE: Confidence-driven Unified Reasoning Ensemble Framework for Medical Question Answering
Elshaer, Ziad, Rashed, Essam A.
–arXiv.org Artificial Intelligence
High-performing medical Large Language Models (LLMs) typically require extensive fine-tuning with substantial computational resources, limiting accessibility for resource-constrained healthcare institutions. This study introduces a confidence-driven multi-model framework that leverages model diversity to enhance medical question answering without fine-tuning. Our framework employs a two-stage architecture: a confidence detection module assesses the primary model's certainty, and an adaptive routing mechanism directs low-confidence queries to Helper models with complementary knowledge for collaborative reasoning. We evaluate our approach using Qwen3-30B-A3B-Instruct, Phi-4 14B, and Gemma 2 12B across three medical benchmarks; MedQA, MedMCQA, and PubMedQA. Result demonstrate that our framework achieves competitive performance, with particularly strong results in PubMedQA (95.0\%) and MedMCQA (78.0\%). Ablation studies confirm that confidence-aware routing combined with multi-model collaboration substantially outperforms single-model approaches and uniform reasoning strategies. This work establishes that strategic model collaboration offers a practical, computationally efficient pathway to improve medical AI systems, with significant implications for democratizing access to advanced medical AI in resource-limited settings.
arXiv.org Artificial Intelligence
Oct-17-2025
- Country:
- Africa > Middle East
- Egypt > Giza Governorate > Giza (0.04)
- Asia > Japan (0.04)
- North America > United States (0.04)
- Africa > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: