What makes math problems hard for reinforcement learning: a case study

Jun-23-2026, 01:20:28 GMT–Neural Information Processing Systems

Using a long-standing conjecture from combinatorial group theory, we explore, from multiple perspectives, the challenges of finding rare instances carrying disproportionately high rewards. Based on lessons learned in the context defined by the Andrews-Curtis conjecture, we analyze how reinforcement learning agents handle problems of varying hardness. We also address many mathematical questions as a part of our study. Notably, we demonstrate the length reducibility of all but two presentations in the Akbulut-Kirby series (1981), and resolve various potential counterexamples in the Miller-Schupp series (1991), including three infinite subfamilies.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Jun-23-2026, 01:20:28 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > California (0.46)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found