AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

XDO: ADoubleOracleAlgorithmfor Extensive-FormGames

Neural Information Processing SystemsFeb-11-2026, 01:02:49 GMT

Policy Space Response Oracles (PSRO) is a reinforcement learning (RL) algorithm for two-player zero-sum games that has been empirically shown to find approximate Nash equilibria in large games.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

c21f4ce780c5c9d774f79841b81fdc6d-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 00:46:23 GMT

linear transition model, q-learning, sample complexity, (11 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

ee23e7ad9b473ad072d57aaa9b2a5222-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 00:36:32 GMT

morphology, neural network, timestep, (13 more...)

Neural Information Processing Systems

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
North America > United States (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre:

Research Report (0.47)
Overview (0.46)
Instructional Material (0.34)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.94)

Add feedback

9c7008aff45b5d8f0973b23e1a22ada0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 00:28:13 GMT

arxiv preprint arxiv, dataset, foundation model, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

c1b8bf9e071c0dabb899e7a27f353762-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 00:27:01 GMT

algorithm, assumption, international conference, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
Information Technology > Data Science > Data Mining > Big Data (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.35)

Add feedback

Near-OptimalRegretBoundsforMulti-batch ReinforcementLearning

Neural Information Processing SystemsFeb-11-2026, 00:18:08 GMT

Meanwhile, we show that to achieve OppolypS,A,Hq?

machine learning, nmh ps, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
Europe > United Kingdom > England (0.04)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Data Science (0.93)

Add feedback

9bcd1fa0c05e5f25ba7a1261f1852e82-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 00:18:05 GMT

algorithm, log 2, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

TowardsTrustworthyAutomaticDiagnosisSystemsby EmulatingDoctors'ReasoningwithDeep ReinforcementLearning

Neural Information Processing SystemsFeb-11-2026, 00:05:48 GMT

Moreover,doctors explicitly explore severepathologies before potentially ruling them out from the differential, especially in acute care settings. Finally, for doctors to trust a system's recommendations, they need to understand how the gathered evidences led to the predicted diseases.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: