AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

ProximalPointImitationLearning

Neural Information Processing SystemsFeb-10-2026, 23:25:53 GMT

Toour knowledge, such guarantees in this setting are providedforthefirsttime.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

TheSensoryNeuronasaTransformer: Permutation-InvariantNeuralNetworksfor ReinforcementLearning

Neural Information Processing SystemsFeb-10-2026, 23:13:46 GMT

In complex systems, we often observe complex global behavior emerge from a collection of agents interacting with each other in their environment, with each individual agent acting only on locally available information, without knowing thefullpicture.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.54)

Add feedback

be26abe76fb5c8a4921cf9d3e865b454-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 23:03:31 GMT

explanatory subgraph, rg-explainer, training epoch, (11 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales > Sydney (0.05)
Asia > China > Hong Kong (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.41)

Add feedback

be26abe76fb5c8a4921cf9d3e865b454-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 23:03:27 GMT

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Genre: Overview (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

e9bcd1b063077573285ae1a41025f5dc-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 22:48:14 GMT

P2SROisabletoparallelize PSROwith convergence guarantees bymaintaining ahierarchical pipeline ofreinforcement learning workers, each training against the policies generated by lower levels in the hierarchy.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: