AITopics | Problem Solving

Collaborating Authors

Problem Solving

News Overviews Instructional Materials AI-Alerts Classics

Learning World Models for Unconstrained Goal Navigation

Neural Information Processing SystemsOct-10-2025, 05:23:18 GMT

Learning world models offers a promising avenue for goal-conditioned reinforcement learning with sparse rewards. By allowing agents to plan actions or exploratory goals without direct interaction with the environment, world models enhance exploration efficiency. The quality of a world model hinges on the richness of data stored in the agent's replay buffer, with expectations of reasonable

subgoal, trajectory, world model, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Montana (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Causal language modeling can elicit search and reasoning capabilities on logic puzzles

Neural Information Processing SystemsOct-10-2025, 04:49:57 GMT

Causal language modeling using the Transformer architecture has yielded remarkable capabilities in Large Language Models (LLMs) over the last few years.

accuracy, puzzle, sudoku puzzle, (14 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Industry: Leisure & Entertainment > Games (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

62ab1c2cb4b03e717005479efb211841-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 04:28:21 GMT

combination law, language model, reasoning boundary, (12 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > China > Hunan Province (0.04)
North America > Canada > Ontario > Toronto (0.04)
(6 more...)

Genre:

Research Report > Experimental Study (0.93)
Overview (0.67)
Research Report > New Finding (0.67)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

5ddcfaad1cb72ce6f1a365e8f1ecf791-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 03:58:23 GMT

deisam, proceedings, reasoning, (14 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Brabant > Eindhoven (0.04)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

54dd9e0cff6d9214e20d97eb2a3bae49-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 02:54:39 GMT

opération, protocol, reagent, (14 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
Europe > Spain > Aragón (0.04)
Europe > Italy > Lazio > Rome (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Materials > Chemicals (1.00)
Law (1.00)
Energy (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.92)
(4 more...)

Add feedback

504fa7e518da9d1b53a233ed20a38b46-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 02:24:23 GMT

Trained on vast corpora of human language, language models demonstrate emergent human-like reasoning abilities.

experiment, iteration, language model, (15 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

FedGTST: Boosting Global Transferability of Federated Models via Statistics Tuning

Neural Information Processing SystemsOct-10-2025, 00:51:45 GMT

In addition to boosting the performance of a target domain model, TL also reduces the computational cost of fine-tuning the target domain model.

dataset, federated learning, transferability, (16 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Virginia (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
(2 more...)

Add feedback

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Li, Ang, Wang, Charles, Fu, Deqing, Yue, Kaiyu, Cai, Zikui, Zhu, Wang Bill, Liu, Ollie, Guo, Peng, Neiswanger, Willie, Huang, Furong, Goldstein, Tom, Goldblum, Micah

arXiv.org Artificial IntelligenceOct-10-2025

Humans often use visual aids, for example diagrams or sketches, when solving complex problems. Training multimodal models to do the same, known as Visual Chain of Thought (Visual CoT), is challenging due to: (1) poor off-the-shelf visual CoT performance, which hinders reinforcement learning, and (2) the lack of high-quality visual CoT training data. We introduce $\textbf{Zebra-CoT}$, a diverse large-scale dataset with 182,384 samples, containing logically coherent interleaved text-image reasoning traces. We focus on four categories of tasks where sketching or visual reasoning is especially natural, spanning scientific questions such as geometry, physics, and algorithms; 2D visual reasoning tasks like visual search and jigsaw puzzles; 3D reasoning tasks including 3D multi-hop inference, embodied and robot planning; visual logic problems and strategic games like chess. Fine-tuning the Anole-7B model on the Zebra-CoT training corpus results in an improvement of +12% in our test-set accuracy and yields up to +13% performance gain on standard VLM benchmark evaluations. Fine-tuning Bagel-7B yields a model that generates high-quality interleaved visual reasoning chains, underscoring Zebra-CoT's effectiveness for developing multimodal reasoning abilities. We open-source our dataset and models to support development and evaluation of visual CoT.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.16746

Country: North America > United States (1.00)

Genre:

Research Report (0.82)
Workflow (0.67)

Industry:

Health & Medicine > Therapeutic Area (0.69)
Leisure & Entertainment > Games (0.66)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis

Wei, Hui, Lee, Dong Yoon, Rohal, Shubham, Hu, Zhizhang, Rossi, Ryan, Fang, Shiwei, Pan, Shijia

arXiv.org Artificial IntelligenceOct-10-2025

Foundation models have gained growing interest in the IoT domain due to their reduced reliance on labeled data and strong generalizability across tasks, which address key limitations of traditional machine learning approaches. However, most existing foundation model based methods are developed for specific IoT tasks, making it difficult to compare approaches across IoT domains and limiting guidance for applying them to new tasks. This survey aims to bridge this gap by providing a comprehensive overview of current methodologies and organizing them around four shared performance objectives by different domains: efficiency, context-awareness, safety, and security & privacy. For each objective, we review representative works, summarize commonly-used techniques and evaluation metrics. This objective-centric organization enables meaningful cross-domain comparisons and offers practical insights for selecting and designing foundation model based solutions for new IoT tasks. We conclude with key directions for future research to guide both practitioners and researchers in advancing the use of foundation models in IoT applications.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2506.12263

Country: North America > United States > California (0.46)

Genre: Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Consumer Health (1.00)
Government (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(3 more...)

Add feedback

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models

Deng, Andong, Yang, Taojiannan, Yu, Shoubin, Spencer, Lincoln, Bansal, Mohit, Chen, Chen, Yeung-Levy, Serena, Wang, Xiaohan

arXiv.org Artificial IntelligenceOct-10-2025

Large Multimodal Models (LMMs) have achieved remarkable progress across various capabilities; however, complex video reasoning in the scientific domain remains a significant and challenging frontier. Current video benchmarks predominantly target general scenarios where perception/recognition is heavily relied on, while with relatively simple reasoning tasks, leading to saturation and thus failing to effectively evaluate advanced multimodal cognitive skills. To address this critical gap, we introduce SciVideoBench, a rigorous benchmark specifically designed to assess advanced video reasoning in scientific contexts. SciVideoBench consists of 1,000 carefully crafted multiple-choice questions derived from cutting-edge scientific experimental videos spanning over 25 specialized academic subjects and verified by a semi-automatic system. Each question demands sophisticated domain-specific knowledge, precise spatiotemporal perception, and intricate logical reasoning, effectively challenging models' higher-order cognitive abilities. Our evaluation highlights significant performance deficits in state-of-the-art proprietary and open-source LMMs, including Gemini 2.5 Pro and Qwen2.5-VL, indicating substantial room for advancement in video reasoning capabilities. Detailed analyses of critical factors such as reasoning complexity and visual grounding provide valuable insights and clear direction for future developments in LMMs, driving the evolution of truly capable multimodal AI co-scientists. We hope SciVideoBench could fit the interests of the community and help to push the boundary of cutting-edge AI for border science.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2510.08559

Genre: Research Report > New Finding (0.46)

Industry:

Education (1.00)
Materials > Chemicals (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Health & Medicine > Therapeutic Area > Neurology (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback