Let LRMs Break Free from Overthinking via Self-Braking Tuning
–Neural Information Processing Systems
Large reasoning models (LRMs), such as OpenAI o1 and DeepSeek-R1, have significantly enhanced their reasoning capabilities by generating longer chains of thought, demonstrating outstanding performance across a variety of tasks. However, this performance gain comes at the cost of a substantial increase in redundant reasoning during the generation process, leading to high computational overhead and exacerbating the issue of overthinking. Although numerous existing approaches aim to address the problem of overthinking, they often rely on external interventions.
Neural Information Processing Systems
Jun-9-2026, 15:45:37 GMT
- Technology: