AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

bb57db42f77807a9c5823bd8c2d9aaef-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 02:58:06 GMT

inequality hold, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

Add feedback

Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs

Neural Information Processing SystemsAug-17-2025, 02:58:02 GMT

More specifically, the discounted MDP is one of the standard MDPs in reinforcement learning to describe sequential tasks without interruption or restart.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

bb1443cc31d7396bf73e7858cea114e1-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 02:48:14 GMT

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)

Add feedback

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Neural Information Processing SystemsAug-17-2025, 02:35:59 GMT

A long-standing goal in artificial intelligence and algorithmic game theory has been to develop a general algorithm which is capable of finding approximate Nash equilibria in large imperfect-information two-player zero-sum games.

approximate nash equilibrium, equilibrium, nash equilibrium, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Orange County > Irvine (0.15)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.99)

Add feedback

ba3c5fe1d6d6708b5bffaeb6942b7e04-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 02:24:59 GMT

information, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: Europe > Austria (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

Reinforcement Learning in Newcomblike Problems

Caspar Oesterheld

Neural Information Processing SystemsAug-17-2025, 02:23:09 GMT

Newcomblike decision problems have been studied extensively in the decision theory literature, but they have so far been largely absent in the reinforcement learning literature.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
Asia > Middle East > Syria > Damascus Governorate > Damascus (0.06)
Asia > Middle East > Syria > Aleppo Governorate > Aleppo (0.05)
(5 more...)

Industry:

Leisure & Entertainment > Games (0.93)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

Neural Information Processing SystemsAug-17-2025, 02:17:35 GMT

Molecular geometry prediction of flexible molecules, or conformer search, is a longstanding challenge in computational chemistry.

conformer, molecule, torsionnet, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Puerto Rico > San Juan > San Juan (0.04)
(2 more...)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

e7532dbeff7ef901f2e70daacb3f452d-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 01:42:04 GMT

machine learning, node, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > India (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Genre: Research Report (0.46)

Industry: Information Technology (0.94)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)

Add feedback

We agree G COMB

Neural Information Processing SystemsAug-17-2025, 01:41:53 GMT

We are addressing only the major comments in this document. In this document, RXCY refers to Comment Y by Reviewer X. We will ensure to make this crystal clear. In contrast, [4] is an end-to-end reinforcement learning architecture and thus time-consuming. The slowness of CELF in IM is also reported in [2].

comb, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.37)

Add feedback