Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Zhang, Kexun, Yao, Weiran, Liu, Zuxin, Feng, Yihao, Liu, Zhiwei, Murthy, Rithesh, Lan, Tian, Li, Lei, Lou, Renze, Xu, Jiacheng, Pang, Bo, Zhou, Yingbo, Heinecke, Shelby, Savarese, Silvio, Wang, Huan, Xiong, Caiming
–arXiv.org Artificial Intelligence
Large language model (LLM) agents have shown great potential in solving realworld software engineering (SWE) problems. The most advanced open-source SWE agent can resolve over 27% of real GitHub issues in SWE-Bench Lite. To fully harness the diversity of these agents, we propose DEI (Diversity Empowered Intelligence), a framework that leverages their unique expertise. DEI functions as a meta-module atop existing SWE agent frameworks, managing agent collectives for enhanced problemsolving. Experimental results show that a DEI-guided committee of agents is able to surpass the best individual agent's performance by a large margin. For instance, a group of open-source SWE agents, with a maximum individual resolve rate of 27.3% on SWE-Bench Lite, can achieve a 34.3% resolve rate with DEI, making a 25% improvement and beating most closed-source solutions. Our findings contribute to the growing body of research on collaborative AI systems and their potential to solve complex software engineering challenges. Recent advancements in large language models (LLMs) have transformed software engineering (SWE) and other domains.
arXiv.org Artificial Intelligence
Aug-13-2024
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Information Technology (0.68)
- Technology: