Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery
Xu, Licong, Sarkar, Milind, Lonappan, Anto I., Zubeldia, Íñigo, Villanueva-Domingo, Pablo, Casas, Santiago, Fidler, Christian, Amancharla, Chetana, Tiwari, Ujjwal, Bayer, Adrian, Ekioui, Chadi Ait, Cranmer, Miles, Dimitrov, Adrian, Fergusson, James, Gandhi, Kahaan, Krippendorf, Sven, Laverick, Andrew, Lesgourgues, Julien, Lewis, Antony, Meier, Thomas, Sherwin, Blake, Surrao, Kristen, Villaescusa-Navarro, Francisco, Wang, Chi, Xu, Xueqing, Bolliet, Boris
–arXiv.org Artificial Intelligence
We present a multi-agent system for automation of scientific research tasks, cmbagent (https://github.com/CMBAgents/cmbagent). The system is formed by about 30 Large Language Model (LLM) agents and implements a Planning & Control strategy to orchestrate the agentic workflow, with no human-in-the-loop at any point. Each agent specializes in a different task (performing retrieval on scientific papers and codebases, writing code, interpreting results, critiquing the output of other agents) and the system is able to execute code locally. We successfully apply cmbagent to carry out a PhD level cosmology task (the measurement of cosmological parameters using supernova data) and evaluate its performance on two benchmark sets, finding superior performance over state-of-the-art LLMs. The source code is available on GitHub, demonstration videos are also available, and the system is deployed on HuggingFace and will be available on the cloud.
arXiv.org Artificial Intelligence
Jul-14-2025
- Country:
- Asia > India
- Punjab (0.04)
- Europe
- France (0.04)
- Germany
- Bavaria > Upper Bavaria
- Munich (0.05)
- North Rhine-Westphalia > Cologne Region
- Aachen (0.04)
- Bavaria > Upper Bavaria
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.15)
- North America
- Canada (0.04)
- United States
- California
- Los Angeles County > Pasadena (0.04)
- San Diego County > San Diego (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- California
- South America > Chile
- Asia > India
- Genre:
- Research Report (0.50)
- Workflow (0.68)
- Technology: