AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

6d70cb65d15211726dcce4c0e971e21c-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 19:44:19 GMT

international conference, iteration, solver, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.14)
(17 more...)

Genre: Research Report (0.47)

Industry: Information Technology (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

4cca5640267b416cef4f00630aef93a2-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 19:31:53 GMT

algorithm, markov game, probability 1, (12 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

59112692262234e3fad47fa8eabf03a4-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 19:31:38 GMT

However,extrinsic rewards may be insufficiently informative to encourage an agent to explore and understand its environment, particularly in partially observed settings where the agent has a limited view of its environment.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
North America > United States > Massachusetts (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

4beaed6a33716fcfe7b5250d10520eb9-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 19:15:21 GMT

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > California > Los Angeles County > Pomona (0.04)
(3 more...)

Genre: Research Report (0.46)

Industry:

Leisure & Entertainment > Games (1.00)
Banking & Finance (0.94)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

A Boolean Task Algebra For Reinforcement Learning

Neural Information Processing SystemsFeb-8-2026, 19:02:12 GMT

A major challenge is thus in designing sample-efficient agents that can transfer their existing knowledge to solve new tasks quickly.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)
Africa > South Africa > Gauteng > Johannesburg (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Add feedback

LocalDifferentialPrivacyforRegretMinimizationin ReinforcementLearning

Neural Information Processing SystemsFeb-8-2026, 18:44:23 GMT

We formulate this notion of privacy for RL by leveraging the local differential privacy(LDP)framework.

machine learning, mechanism, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Information Technology > Security & Privacy (0.93)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Security & Privacy (0.93)

Add feedback

580760fb5def6e2ca8eaf601236d5b08-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 18:44:19 GMT

algorithm, information, privacy, (13 more...)

Neural Information Processing Systems

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
(3 more...)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

4b32c2943a02331792877cc6b5205f49-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 18:43:07 GMT

Deep learning algorithms have shown significant development thanks tothelargepre-collected dataset, such as SQuAD [47]innatural language processing (NLP), and ImageNet [4] in computer vision.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: