AITopics | Problem Solving

Our findings on NoRa dataset reveal a prevalent vulnerability to such noise among current LLMs, with existing robust methods like self-correction and self-consistency showing limited efficacy.

dataset, digit, rationale, (16 more...)

Neural Information Processing Systems

Country:

Europe > France (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(5 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (0.92)

Industry:

Leisure & Entertainment > Games (1.00)
Health & Medicine > Therapeutic Area (0.67)
Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(3 more...)

Add feedback

d81cb1f4dc6e13aeb45553f80b3d6837-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 18:16:48 GMT

calculation incorrect contextual logic link, evaluate model-generated step-by-step solution, relativistic kinetic energy formula, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(13 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Workflow (0.73)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Education > Curriculum > Subject-Specific Education (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Learning Goal-Conditioned Representations for Language Reward Models

Neural Information Processing SystemsOct-10-2025, 17:44:24 GMT

Nevertheless, it is unclear how improved representation learning can benefit reinforcement learning from human feedback on language models.

goal state, representation, reward model, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > Canada (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Information Technology (1.00)
Leisure & Entertainment (0.93)
Banking & Finance > Trading (0.92)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.97)

Add feedback

A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts

Neural Information Processing SystemsOct-10-2025, 17:28:40 GMT

Recent evidence suggests that, in some problems, NeSy models can achieve high accuracy on the reasoning task by learning concepts with incorrect semantics .

architecture, dataset, kernel, (14 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Asia > Sri Lanka > Central Province > Kandy District > Kandy (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government (1.00)
Information Technology (0.67)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(5 more...)

Add feedback

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Neural Information Processing SystemsOct-10-2025, 16:39:49 GMT

Limitations in either capability can impede the overall performance of a VLM. A systematic evaluation of the perception and reasoning capabilities is crucial to provide valuable insights for future model optimization.

arxiv preprint arxiv, instruction, vlm, (14 more...)

Neural Information Processing Systems

Country: