Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering

May-7-2025–arXiv.org Artificial Intelligence

--This study explores the application of chaos engineering to enhance the robustness of Large Language Model-Based Multi-Agent Systems (LLM-MAS) in production-like environments under real-world conditions. LLM-MAS can potentially improve a wide range of tasks, from answering questions and generating content to automating customer support and improving decision-making processes. However, LLM-MAS in production or preproduction environments can be vulnerable to emergent errors or disruptions, such as hallucinations, agent failures, and agent communication failures. This study proposes a chaos engineering framework to proactively identify such vulnerabilities in LLM-MAS, assess and build resilience against them, and ensure reliable performance in critical applications. I NTRODUCTION Large Language Models (LLMs) such as Bing [1], Gemini [2], and ChatGPT [3] have transformed natural language processing (NLP) through innovations such as transformer architectures [4] and large-scale pretraining [5].

engineering, large language model, machine learning, (14 more...)

arXiv.org Artificial Intelligence

May-7-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands (0.14)

Genre:
- Research Report (0.86)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found