AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

217a2a387f52c30755c37b0a73430291-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-7-2026, 21:13:55 GMT

algorithm, dexterous manipulation, manipulation, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > Montana (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.86)

Add feedback

2119b5ac365c30dfac17a840c2755c30-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 21:03:34 GMT

function approximation, neural network, rademacher complexity, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)

Genre:

Research Report > New Finding (0.46)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.93)

Add feedback

UnderstandingDeepNeuralFunctionApproximation inReinforcementLearningviaϵ-GreedyExploration

Neural Information Processing SystemsFeb-7-2026, 21:03:30 GMT

This problem setting is motivated by the successful deep Q-networks (DQN) framework that falls in this regime.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: Europe > Switzerland (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)

Add feedback

274e6fcf4a583de4a81c6376f17673e7-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 20:56:07 GMT

agent, generalization, goal imagination, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada (0.04)
Europe > France (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.68)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

228bbc2f87caeb21bb7f6949fddcb91d-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 20:44:53 GMT

algorithm, neural information processing system, variance, (11 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Data Science > Data Mining > Big Data (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

26588e932c7ccfa1df309280702fe1b5-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 20:43:59 GMT

agent, arxiv preprint arxiv, representation, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.15)
Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre:

Research Report (0.46)
Instructional Material (0.46)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

218344619d8fb95d504ccfa11804073f-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 20:05:26 GMT

agent, compositional task, generator, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)

Genre: Research Report (0.46)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Add feedback

212ab20dbdf4191cbcdcf015511783f4-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 19:45:44 GMT

assumption, international conference, reinforcement, (11 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Inferringlearningrulesfromanimaldecision-making

Neural Information Processing SystemsFeb-7-2026, 19:44:59 GMT

Our method efficiently infers the trial-to-trial changes inananimal'spolicy,and decomposes those changes into a learning component and a noise component.

machine learning, reinforcement learning, trajectory, (19 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)

Add feedback

Fractal Landscapes in Policy Optimization

Neural Information Processing SystemsFeb-7-2026, 19:43:34 GMT

The understanding of such failure cases is still limited. For instance, the training process of reinforcement learning is unstable and the learning curve can fluctuate during training in ways that are hard to predict. The probability of obtaining satisfactory policies can also be inherently low in reward-sparse or highly nonlinear control tasks.

machine learning, objective function, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback