AITopics | path length

What makes math problems hard for reinforcement learning: a case study

Neural Information Processing SystemsJun-23-2026, 01:20:28 GMT

Using a long-standing conjecture from combinatorial group theory, we explore, from multiple perspectives, the challenges of finding rare instances carrying disproportionately high rewards. Based on lessons learned in the context defined by the Andrews-Curtis conjecture, we analyze how reinforcement learning agents handle problems of varying hardness. We also address many mathematical questions as a part of our study. Notably, we demonstrate the length reducibility of all but two presentations in the Akbulut-Kirby series (1981), and resolve various potential counterexamples in the Miller-Schupp series (1991), including three infinite subfamilies.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.46)

Genre: Research Report > Experimental Study (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

SOAT: AScene-and Object-Aware Transformer for Vision-and-Language Navigation

Neural Information Processing SystemsApr-25-2026, 12:58:01 GMT

A.1 Limitations We propose an approach which exploits object features in addition to scene features for vision-andlanguage navigation (VLN). Our approach is able to utilize object features for better visiolinguistic alignment (see Section 5) despite the domain gap between the images used to train the object detector and VLN data. Specifically, object features are obtained using a Faster R-CNN detector [1] trained on photos from web (Visual Genome [2]), in which objects are typically well framed by the photographer. On the other hand, the VLN datasets used in our experiments contain panoramic images from indoor house scans that capture objects at viewing angles determined by the navigation path. The gap between these two types of data could be eliminated by either fine-tuning or training detector directly on indoor scenes.

agent, artificial intelligence, ascene-and object-aware transformer, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Results

Neural Information Processing SystemsApr-24-2026, 17:09:42 GMT

In this section we prove the theoretical results around the dual curriculum game and use these results to show approximation bounds for our methods, given that they have reached a Nash equilibrium (NE). The first theorem is the main result that allows us to analyze dual curriculum games. The high-level result says that the NE of a dual curriculum game are approximate NE of the base game from the perspective of any of the individual players, or from the perspective of the joint strategy. Let Bbe the maximum difference between U1t and U2t, and let (π,θ1,θ2) be a NE for G. Then (π,pθ1 + (1 p)θ2) is an approximate NE for the base game with either teacher or for a teacher optimizing their joint objective. More precisely, it is a 2Bp(1 p)-approximate NE when Ut = pU1t + (1 p)U2t, a 2B(1 p)-approximate NE when Ut = U1t, and a 2Bp-approximate NE when Ut = U2t. At a high level, this is true because, for low values of p, the best-response strategies for the individual players can be thought of as approximate-best response strategies for the joint-player, and vis-versa. Since the Nash Equilibrium consists of each of the players playing their own best response, they must be playing an approximate best response for the joint-player. We provide a formal proof below: Proof. Let B be the maximum difference between U1t and U2t, and let (π,θ1,θ2) be a Nash Equilibrium for G. Then consider pθ1 + (1 p)θ2 as a strategy in the base game for the joint player pU1t + (1 p)U2t.

agent, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
Asia > China (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Sports > Motorsports > Formula One (1.00)
Leisure & Entertainment > Games (0.74)

Technology:

Information Technology > Game Theory (0.90)
Information Technology > Artificial Intelligence > Machine Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

0a245311a23460d1846043d4156445d6-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 11:50:01 GMT

machine learning, natural language, node, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.29)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

37bc2f75bf1bcfe8450a1a41c200364c-Paper.pdf

Neural Information Processing SystemsMar-23-2026, 06:20:22 GMT

artificial intelligence, machine learning, residual network, (19 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Unconstrained Dynamic Regret via Sparse Coding

Neural Information Processing SystemsFeb-17-2026, 19:21:24 GMT

Nonstationarity is prevalent in sequential decision making, which poses a critical challenge to the vast majority of existing approaches developed offline.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.94)

Add feedback

GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game Maps

Neural Information Processing SystemsFeb-11-2026, 05:33:03 GMT

Large language models (LLMs) have recently demonstrated great success in generating and understanding natural language. While they have also shown potential beyond the domain of natural language, it remains an open question as to what extent and in which way these LLMs can plan.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Africa > South Africa > Gauteng > Johannesburg (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

62da5a6d47be0029801ba74a17e47e1a-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 15:55:45 GMT

algorithm, prediction, probe, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(5 more...)

Industry: Transportation (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

0a245311a23460d1846043d4156445d6-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 09:34:23 GMT

graph, node, trajectory, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.29)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Learning under Distributional Drift: Reproducibility as an Intrinsic Statistical Resource

Zaichyk, Sofiya

arXiv.org Machine LearningDec-16-2025

Statistical learning under distributional drift remains insufficiently characterized: when each observation alters the data-generating law, classical generalization bounds can collapse. We introduce a new statistical primitive, the reproducibility budget $C_T$, which quantifies a system's finite capacity for statistical reproducibility - the extent to which its sampling process can remain governed by a consistent underlying distribution in the presence of both exogenous change and endogenous feedback. Formally, $C_T$ is defined as the cumulative Fisher-Rao path length of the coupled learner-environment evolution, measuring the total distributional motion accumulated during learning. From this construct we derive a drift-feedback generalization bound of order $O(T^{-1/2} + C_T/T)$, and we prove a matching minimax lower bound showing that this rate is minimax-optimal. Consequently, the results establish a reproducibility speed limit: no algorithm can achieve smaller worst-case generalization error than that imposed by the average Fisher-Rao drift rate $C_T/T$ of the data-generating process. The framework situates exogenous drift, adaptive data analysis, and performative prediction within a common geometric structure, with $C_T$ emerging as the intrinsic quantity measuring distributional motion across these settings.

learner, learning, trajectory, (14 more...)

arXiv.org Machine Learning

2512.13506

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Filters

Collaborating Authors

path length

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

What makes math problems hard for reinforcement learning: a case study

SOAT: AScene-and Object-Aware Transformer for Vision-and-Language Navigation

Results

0a245311a23460d1846043d4156445d6-Supplemental-Conference.pdf

37bc2f75bf1bcfe8450a1a41c200364c-Paper.pdf

Unconstrained Dynamic Regret via Sparse Coding

GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game Maps

62da5a6d47be0029801ba74a17e47e1a-Paper.pdf

0a245311a23460d1846043d4156445d6-Supplemental-Conference.pdf

Learning under Distributional Drift: Reproducibility as an Intrinsic Statistical Resource