Interactive Debugging and Steering of Multi-Agent AI Systems
Epperson, Will, Bansal, Gagan, Dibia, Victor, Fourney, Adam, Gerrits, Jack, Zhu, Erkang, Amershi, Saleema
–arXiv.org Artificial Intelligence
Fully autonomous teams of LLM-powered AI agents are emerging that collaborate to perform complex tasks for users. What challenges do developers face when trying to build and debug these AI agent teams? In formative interviews with five AI agent developers, we identify core challenges: difficulty reviewing long agent conversations to localize errors, lack of support in current tools for interactive debugging, and the need for tool support to iterate on agent configuration. Based on these needs, we developed an interactive multi-agent debugging tool, AGDebugger, with a UI for browsing and sending messages, the ability to edit and reset prior agent messages, and an overview visualization for navigating complex message histories. In a two-part user study with 14 participants, we identify common user strategies for steering agents and highlight the importance of interactive message resets for debugging. Our studies deepen understanding of interfaces for debugging increasingly important agentic workflows.
arXiv.org Artificial Intelligence
Mar-3-2025
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- North America
- United States
- South Carolina > Greenville County
- Greenville (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Massachusetts > Suffolk County
- Boston (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Redmond (0.04)
- New York > New York County
- New York City (0.15)
- Pennsylvania > Allegheny County
- Pittsburgh (0.14)
- South Carolina > Greenville County
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- Asia
- Thailand > Bangkok
- Bangkok (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Middle East > Palestine
- Gaza Strip > Rafah Governorate > Rafah (0.04)
- Japan > Honshū
- Kantō > Kanagawa Prefecture > Yokohama (0.05)
- Thailand > Bangkok
- South America > Colombia
- Genre:
- Research Report > New Finding (1.00)
- Questionnaire & Opinion Survey (1.00)
- Personal > Interview (1.00)
- Industry:
- Information Technology (0.46)
- Technology: