Analyzing the Power of Chain of Thought through Memorization Capabilities
–Neural Information Processing Systems
It has been shown that the chain of thought (CoT) can enhance the power of large language models (LLMs) to solve certain mathematical reasoning problems. However, the capacity of CoT is still not fully explored. As an important instance, the following basic question has not yet been answered: Does CoT expand the capability of transformers across all reasoning tasks? We demonstrate that reasoning with transformers is essentially a memorization problem for reasoning datasets.
Neural Information Processing Systems
Jun-16-2026, 20:46:21 GMT