AITopics | Problem Solving

Collaborating Authors

Problem Solving

News Overviews Instructional Materials AI-Alerts Classics

Long Chain-of-Thought Reasoning Across Languages

Barua, Josh, Eisape, Seun, Yin, Kayo, Suhr, Alane

arXiv.org Artificial IntelligenceOct-10-2025

While large reasoning models have shown remarkable ability to generate long chains-of-thought (CoTs) in English, we still lack understanding of how these long-form reasoning abilities transfer to the vast majority of the world's languages. In this work, we systematically investigate four key stages of model development--scaling, pretraining, post-training, and inference--to understand how long CoT capabilities extend beyond English. We compare two reasoning settings across nine non-English target languages: En-CoT, where models process target-language inputs, but reason in English; and Target-CoT, where models both process inputs and generate long CoTs in the target language. We find that scaling reasoning model size improves multilingual task performance in En-CoT, but Target-CoT performance lags behind. This gap widens for tasks requiring long, multi-step CoTs such as mathematical reasoning. Shifting to pretraining, we find that adding a specialized reasoning stage enhances En-CoT performance but degrades Target-CoT, whereas broad multilingual pretraining improves both modes simultaneously. Given the scarcity of high-quality reasoning traces in languages other than English, we explore synthetic data curation approaches for post-training. We demonstrate that fine-tuning on reasoning traces automatically translated from gold English traces outperforms fine-tuning on target-language traces distilled from large reasoning models. Finally, we report disparities in inference efficiency between languages and uncover language-specific failure modes in CoTs. We release models, datasets, and code to foster further research.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2508.14828

Country:

Africa (0.46)
Asia (0.28)
Europe (0.28)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Play to Generalize: Learning to Reason Through Game Play

Xie, Yunfei, Ma, Yinsong, Lan, Shiyi, Yuille, Alan, Xiao, Junfei, Wei, Chen

arXiv.org Artificial IntelligenceOct-10-2025

Developing reasoning capabilities in multimodal large language models (MLLMs) remains challenging. Motivated by literature suggesting that gameplay promotes transferable reasoning skills, we propose a novel post-training method, Visual Game Learning (ViGaL), where MLLMs develop generalizable reasoning skills through playing arcade-like games. Specifically, we show that training a 7B-parameter MLLM via reinforcement learning (RL) on simple games like Snake significantly enhances the downstream performance on multimodal math benchmarks like MathVista, on multi-discipline questions like MMMU and on 3D spatial reasoning benchmarks like VSI-Bench, without seeing any worked solutions, equations, or diagrams during RL. Remarkably, our model outperforms specialist models post-trained on benchmark-oriented multimodal reasoning data, while preserving the model's performance on general visual benchmarks, a challenge where specialist models often fall short. Our findings suggest that multimodal reasoning can emerge from gameplay, pointing to a promising strategy of designing surrogate tasks for RL post-training.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2506.08011

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(3 more...)

Add feedback

Chain of Thoughtlessness An Analysis of CoT in Planning

Neural Information Processing SystemsOct-9-2025, 22:48:56 GMT

Previous work has claimed that this can be mitigated with chain of thought prompting-a method of demonstrating solution procedures-with the intuition that it is possible to in-context teach an LLM an algorithm for solving the problem.

arxiv preprint arxiv, current state, expression, (14 more...)

Neural Information Processing Systems

Country: North America > United States > Arizona (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)

Add feedback

AlphaMath Almost Zero: Process Supervision without Process

Neural Information Processing SystemsOct-9-2025, 22:34:38 GMT

LLMs possess a vast reservoir of knowledge, which remains under-utilized in current finetuning-based approaches.

arxiv preprint arxiv, dataset, value model, (14 more...)

Neural Information Processing Systems

Country: Asia > Thailand > Bangkok > Bangkok (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)
Workflow (0.92)

Industry:

Leisure & Entertainment > Games (0.93)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(2 more...)

Add feedback

222d2eaf24cf8259a35d6c7130d31425-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-9-2025, 20:54:00 GMT

arxiv preprint arxiv, benchmark, reasoning ability, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Oceania > New Zealand (0.04)
(3 more...)

Genre: Research Report (0.92)

Industry:

Health & Medicine (0.68)
Education > Curriculum > Subject-Specific Education (0.67)
Education > Educational Setting (0.46)
Energy > Renewable (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Software > Programming Languages (0.92)
(2 more...)

Add feedback

Goal Reduction with Loop-Removal Accelerates RL and Models Human Brain Activity in Goal-Directed Learning

Neural Information Processing SystemsOct-9-2025, 17:36:03 GMT

The pressure for survival prohibits slow, linear adaptation to different goals, i.e., learning value functions from scratch for each new objective.

agent, experiment, subgoal, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)
Asia > China (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.65)

Add feedback

01025a4e79355bb37a10ba39605944b5-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 16:57:26 GMT

dataset, language model, rationale, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy (0.04)
Asia > India (0.04)
Indian Ocean > Arabian Gulf (0.04)
(5 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Media > Film (0.68)
Leisure & Entertainment (0.68)
Education > Curriculum > Subject-Specific Education (0.67)
Education > Educational Setting (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(2 more...)

Add feedback

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

Neural Information Processing SystemsOct-9-2025, 16:57:15 GMT

This deliberation, however, comes at the cost of significantly increased inference complexity.

arxiv preprint arxiv, language model, reasoning path, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Maine (0.04)
Asia > Singapore (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Government (0.47)
Information Technology (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Reports of the Workshops Held at the 2025 AAAI Conference on Artificial Intelligence

Interactive AI MagazineOct-9-2025, 15:55:18 GMT

The Workshop Program of the Association for the Advancement of Artificial Intelligence's 39th Conference on Artificial Intelligence (AAAI-25) was held in Philadelphia, Pennsylvania, on February 25 - March 4, 2025. TIKA is envisioned to create an open knowledge resource and serve as a hub for research, education and training on knowledge representation and knowledge engineering. Over 50 AI researchers convened at the workshop over two days. The discussions focused on different aspects of creating an open knowledge resource including foundational knowledge, automated reasoning, knowledge curation, education on knowledge axiomatization, and evaluation of outcomes. The opening discussion confirmed that the idea of curated knowledge, that is, knowledge captured in an expressive formal language that can be explicitly examined and verified by humans, is compelling. It must, however, be situated in the modern context of AI. Such a resource should address the limitations of existing generative ...

application, university, workshop, (13 more...)

Interactive AI Magazine

Country: