Collaboration among Multiple Large Language Models for Medical Question Answering
Shang, Kexin, Chang, Chia-Hsuan, Yang, Christopher C.
–arXiv.org Artificial Intelligence
Empowered by vast internal knowledge reservoir, the new generation of large language models (LLMs) demonstrate untapped potential to tackle medical tasks. However, there is insufficient effort made towards summoning up a synergic effect from multiple LLMs' expertise and background. In this study, we propose a multi-LLM collaboration framework tailored on a medical multiple-choice questions dataset. Through post-hoc analysis on 3 pre-trained LLM participants, our framework is proved to boost all LLMs reasoning ability as well as alleviate their divergence among questions. We also measure an LLM's confidence when it confronts with adversary opinions from other LLMs and observe a concurrence between LLM's confidence and prediction accuracy.
arXiv.org Artificial Intelligence
May-23-2025
- Country:
- Asia > Thailand
- North America
- Canada > Ontario
- Toronto (0.04)
- United States (0.68)
- Canada > Ontario
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Government > Regional Government (0.46)
- Health & Medicine (1.00)
- Technology: