SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought
–Neural Information Processing Systems
Chain-of-Thought (CoT) prompting improves the reasoning performance of large language models (LLMs) by encouraging step-by-step thinking. However, CoT-based methods depend on intermediate reasoning steps, which limits scalability and generalization. Recent work explores recursive reasoning, where LLMs reuse internal layers across iterations to refine latent representations without explicit CoT supervision. While promising, these approaches often require costly pretraining and lack a principled framework for how reasoning should evolve across iterations.
Neural Information Processing Systems
Jun-13-2026, 04:29:53 GMT
- Technology: