AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

Generalized Off-Policy Actor-Critic

Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

Neural Information Processing SystemsFeb-11-2026, 11:15:17 GMT

Neural Information Processing Systems http://nips.cc/

geoff-p ac, objective, policy gradient, (9 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

b2b781badeeb49896c4b324c466ec442-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 10:48:14 GMT

cost function, expert demonstration, rhirl, (13 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.05)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

b2a1c152f14a4b842a9ddb3bd84c62a1-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-11-2026, 10:47:53 GMT

agent, algorithm, international conference, (11 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Europe > Sweden > Skåne County > Malmö (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.68)

Industry:

Information Technology (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

da4fb5c6e93e74d3df8527599fa62642-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 10:47:45 GMT

algorithm, algorithm 1, ambiguity, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > New Hampshire (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
Asia > China > Hong Kong (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)

Add feedback

4aa8891583f07ae200ba07843954caeb-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-11-2026, 10:36:28 GMT

algorithm, implementation, mo-gymnasium, (17 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
South America > Brazil > Rio Grande do Sul (0.04)
North America > United States > Massachusetts (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Robots (0.93)
(2 more...)

Add feedback

RobustInverseReinforcementLearningunder TransitionDynamicsMismatch

Neural Information Processing SystemsFeb-11-2026, 10:35:52 GMT

Leveraginginsights from theRobustRLliterature, wepropose arobustMCEIRLalgorithm, which is a principled approach to help with this mismatch. Finally, we empirically demonstrate the stable performance of our algorithm compared to the standard MCEIRL algorithm under transition dynamics mismatches in both finite and continuousMDPproblems.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)

Add feedback

d9e74f47610385b11e295eec4c58d473-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 10:35:49 GMT

algorithm, mismatch, transition dynamic mismatch, (8 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

YiDing Jiang, Shixiang (Shane) Gu, Kevin P. Murphy, Chelsea Finn

Neural Information Processing SystemsFeb-11-2026, 10:28:16 GMT

With the ability to learn concepts and sub-skills that can be composed to solve longer tasks, i.e. hierarchical RL, wecanacquire temporally-extended behaviors.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.98)

Add feedback