Accelerating Chain of Thought Reasoning through Semantically Aligned Implicit Tokens

Jun-16-2026, 16:17:28 GMT–Neural Information Processing Systems

Chain-of-Thought (CoT) enhances the performance of Large Language Models (LLMs) on reasoning tasks by encouraging step-by-step solutions. However, the verbosity of CoT reasoning hinders its mass deployment in efficiency-critical applications. Recently, implicit CoT approaches have emerged, which encode reasoning steps within LLM's hidden embeddings (termed "implicit reasoning") rather than explicit tokens. This approach accelerates CoT reasoning by reducing the reasoning length and bypassing some LLM components. However, existing implicit CoT methods face two significant challenges: (1) they fail to preserve the semantic alignment between the implicit reasoning (when transformed to natural language) and the ground-truth reasoning, resulting in a significant CoT performance degradation, and (2) they focus on reducing the length of the implicit reasoning; however, they neglect the considerable time cost for an LLM to generate one individual implicit reasoning token.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Jun-16-2026, 16:17:28 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.68)
- North America > United States
  - California (0.28)
  - Virginia > Albemarle County
    - Charlottesville (0.14)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Education (0.68)
- Information Technology (0.46)
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found