AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

e0cfde0ff720fa9674bb976e7f1b99d4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 10:12:26 GMT

machine learning, natural language, reinforcement learning, (11 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.94)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Bootstrapped Transformer for Offline Reinforcement Learning Kerong Wang Shanghai Jiao Tong University Hanye Zhao Shanghai Jiao Tong University Xufang Luo Microsoft Research Asia Kan Ren

Neural Information Processing SystemsFeb-12-2026, 10:12:07 GMT

The work was conducted during the internship of Kerong Wang and Hanye Zhao at Microsoft Research.

machine learning, reinforcement learning, trajectory, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.76)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Scalable Coordinated Exploration in Concurrent Reinforcement Learning

Maria Dimakopoulou, Ian Osband, Benjamin Van Roy

Neural Information Processing SystemsFeb-12-2026, 09:50:38 GMT

Neural Information Processing Systems http://nips.cc/

agent, algorithm, reinforcement, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards

Alexander Trott, Stephan Zheng, Caiming Xiong, Richard Socher

Neural Information Processing SystemsFeb-12-2026, 09:42:33 GMT

Neural Information Processing Systems http://nips.cc/

agent, local optima, sparse reward, (15 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Skåne County > Malmö (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Leisure & Entertainment (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Learning to Share and Hide Intentions using Information Regularization

DJ Strouse, Max Kleiman-Weiner, Josh Tenenbaum, Matt Botvinick, David J. Schwab

Neural Information Processing SystemsFeb-12-2026, 09:40:43 GMT

Neural Information Processing Systems http://nips.cc/

agent, alice, information, (12 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.68)
(2 more...)

Add feedback

5d4cd12ef6efedbf26b69b410f1f7d67-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 09:32:55 GMT

constraint, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

ContrastiveIntrinsicControlforUnsupervised ReinforcementLearning

Neural Information Processing SystemsFeb-12-2026, 09:03:40 GMT

Unlikeknowledge-based anddata-basedalgorithms, competence-based algorithms simultaneously address both the exploration challenge as well as distilling the generated experience in the form of reusable skills.

intrinsic reward, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
Europe > Italy > Sardinia (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

EffectsofSafetyStateAugmentationon SafeExploration

Neural Information Processing SystemsFeb-12-2026, 09:02:22 GMT

There are still, however, some unsolved challenges for a successful deployment of RL such as efficient learning of constrained or safe Markov Decision Processes (MDPs) [4].

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: Africa > Togo (0.04)

Genre: Research Report (0.46)

Technology: