Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought

Jun-12-2026, 19:01:57 GMT–Neural Information Processing Systems

Large Language Models (LLMs) have demonstrated remarkable performance in many applications, including challenging reasoning problems via chain-of-thought (CoT) techniques that generate ``thinking tokens'' before answering the questions. While existing theoretical works demonstrate that CoT with discrete tokens boosts the capability of LLMs, recent work on continuous CoT lacks a theoretical understanding of why it outperforms discrete counterparts in various reasoning tasks, such as directed graph reachability, a fundamental graph reasoning problem that includes many practical domain applications as special cases.

artificial intelligence, large language model, natural language, (7 more...)

Neural Information Processing Systems

Jun-12-2026, 19:01:57 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.85)