Voting or Consensus? Decision-Making in Multi-Agent Debate
Kaesberg, Lars Benedikt, Becker, Jonas, Wahle, Jan Philip, Ruas, Terry, Gipp, Bela
–arXiv.org Artificial Intelligence
Much of the success of multi-agent debates depends on carefully choosing the right parameters. Among them, the decision-making protocol stands out. Systematic comparison of decision protocols is difficult because studies alter multiple discussion parameters beyond the protocol. So far, it has been largely unknown how decision-making addresses the challenges of different tasks. This work systematically evaluates the impact of seven decision protocols (e.g., majority voting, unanimity consensus). We change only one variable at a time (i.e., decision protocol) to analyze how different methods affect the collaboration between agents and test different protocols on knowledge (MMLU, MMLU-Pro, GPQA) and reasoning datasets (StrategyQA, MuSR, SQuAD 2.0). Our results show that voting protocols improve performance by 13.2% in reasoning tasks and consensus protocols by 2.8% in knowledge tasks over the other decision protocol. Increasing the number of agents improves performance, while more discussion rounds before voting reduces it. To improve decision-making by increasing answer diversity, we propose two new methods, All-Agents Drafting (AAD) and Collective Improvement (CI). Our methods improve task performance by up to 3.3% with AAD and up to 7.4% with CI. This work demonstrates the importance of decision-making in multi-agent debates beyond scaling.
arXiv.org Artificial Intelligence
Feb-26-2025
- Country:
- Africa > Rwanda (0.14)
- Europe > Germany
- Lower Saxony > Gottingen (0.14)
- North America > United States (0.14)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.94)
- Leisure & Entertainment (0.68)
- Media > Music (0.68)
- Technology: