Reasoning Beyond Limits: Advances and Open Problems for LLMs
Ferrag, Mohamed Amine, Tihanyi, Norbert, Debbah, Merouane
–arXiv.org Artificial Intelligence
Recent generative reasoning breakthroughs have transformed how large language models (LLMs) tackle complex problems by dynamically retrieving and refining information while generating coherent, multi-step thought processes. Techniques such as inference-time scaling, reinforcement learning, supervised fine-tuning, and distillation have been successfully applied to models like DeepSeek-R1, OpenAI's o1 & o3, GPT-4o, Qwen-32B, and various Llama variants, resulting in enhanced reasoning capabilities. In this paper, we provide a comprehensive analysis of the top 27 LLM models released between 2023 and 2025 (including models such as Mistral AI Small 3 24B, DeepSeek-R1, Search-o1, QwQ-32B, and phi-4). Then, we present an extensive overview of training methodologies that spans general training approaches, mixture-of-experts (MoE) and architectural innovations, retrieval-augmented generation (RAG), chain-of-thought and self-improvement techniques, as well as test-time compute scaling, distillation, and reinforcement learning (RL) methods. Finally, we discuss the key challenges in advancing LLM capabilities, including improving multi-step reasoning without human supervision, overcoming limitations in chained tasks, balancing structured prompts with flexibility, and enhancing long-context retrieval and external tool integration.
arXiv.org Artificial Intelligence
Mar-26-2025
- Country:
- Africa > Middle East
- Algeria > Guelma Province > Guelma (0.04)
- Asia
- China > Shanghai
- Shanghai (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Middle East > UAE (0.04)
- China > Shanghai
- Europe > Hungary (0.04)
- North America > United States (0.27)
- Africa > Middle East
- Genre:
- Overview (1.00)
- Research Report
- New Finding (1.00)
- Promising Solution (1.00)
- Industry:
- Education (0.92)
- Energy (0.67)
- Information Technology > Security & Privacy (1.00)
- Technology: