From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Ferrag, Mohamed Amine, Tihanyi, Norbert, Debbah, Merouane
–arXiv.org Artificial Intelligence
Large language models and autonomous AI agents have evolved rapidly, resulting in a diverse array of evaluation benchmarks, frameworks, and collaboration protocols. However, the landscape remains fragmented and lacks a unified taxonomy or comprehensive survey. Therefore, we present a side-by-side comparison of benchmarks developed between 2019 and 2025 that evaluate these models and agents across multiple domains. In addition, we propose a taxonomy of approximately 60 benchmarks that cover general and academic knowledge reasoning, mathematical problem-solving, code generation and software engineering, factual grounding and retrieval, domain-specific evaluations, multimodal and embodied tasks, task orchestration, and interactive assessments. Furthermore, we review AI-agent frameworks introduced between 2023 and 2025 that integrate large language models with modular toolkits to enable autonomous decision-making and multi-step reasoning. Moreover, we present real-world applications of autonomous AI agents in materials science, biomedical research, academic ideation, software engineering, synthetic data generation, chemical reasoning, mathematical problem-solving, geographic information systems, multimedia, healthcare, and finance. We then survey key agent-to-agent collaboration protocols, namely the Agent Communication Protocol (ACP), the Model Context Protocol (MCP), and the Agent-to-Agent Protocol (A2A). Finally, we discuss recommendations for future research, focusing on advanced reasoning strategies, failure modes in multi-agent LLM systems, automated scientific discovery, dynamic tool integration via reinforcement learning, integrated search capabilities, and security vulnerabilities in agent protocols.
arXiv.org Artificial Intelligence
Apr-29-2025
- Country:
- Africa > Middle East
- Algeria > Guelma Province > Guelma (0.04)
- Asia
- China (0.04)
- Middle East
- Syria > Daraa Governorate
- Dar'a (0.04)
- UAE (0.04)
- Syria > Daraa Governorate
- Europe > Hungary (0.04)
- North America > United States (0.45)
- Africa > Middle East
- Genre:
- Overview (1.00)
- Research Report
- New Finding (1.00)
- Promising Solution (1.00)
- Workflow (1.00)
- Industry:
- Banking & Finance > Trading (1.00)
- Education
- Government > Regional Government
- Health & Medicine
- Diagnostic Medicine (1.00)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Cardiology/Vascular Diseases (0.67)
- Oncology (0.92)
- Information Technology > Security & Privacy (1.00)
- Law (0.92)
- Leisure & Entertainment > Games (1.00)
- Media (1.00)
- Technology: