Who Reasons in the Large Language Models?

Jun-13-2026, 15:50:53 GMT–Neural Information Processing Systems

Despite the impressive performance of large language models (LLMs), the process of endowing them with new capabilities---such as mathematical reasoning---remains largely empirical and opaque. A critical open question is whether reasoning abilities stem from the entire model, specific modules, or are merely artifacts of overfitting. In this work, we hypothesize that the reasoning capabilities in well-trained LLMs are primarily attributed to the output projection module (o proj plays a central role in enabling reasoning, whereas other modules contribute more to fluent dialogue. These findings offer a new perspective on LLM interpretability and open avenues for more targeted training strategies, potentially enabling more efficient and specialized LLMs.

artificial intelligence, large language model, natural language, (11 more...)

Neural Information Processing Systems

Jun-13-2026, 15:50:53 GMT

Conferences Web Page

Add feedback

Industry:
- Education (0.63)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)