AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

0cb929eae7a499e50248a3a78f7acfc7-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 11:22:52 GMT

algorithm, reward function, sample complexity, (12 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Los Angeles (0.29)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.42)

Add feedback

AConsciousness-InspiredPlanningAgentfor Model-Based ReinforcementLearning

Neural Information Processing SystemsFeb-7-2026, 11:22:19 GMT

Whether when planning our paths home from the office or from a hotel to an airport in an unfamiliar city, we typically focus on a small subset of relevant variables,e.g. the changeinposition orthepresence oftraffic.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Conservative Q-Learning for Offline Reinforcement Learning A viral Kumar

Neural Information Processing SystemsFeb-7-2026, 11:14:43 GMT

Effectively leveraging large, previously collected datasets in reinforcement learning (RL) is a key challenge for large-scale real-world applications. Offline RL algorithms promise to learn effective policies from previously-collected, static datasets without further interaction.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Model-BasedMulti-AgentRLinZero-SumMarkov GameswithNear-OptimalSampleComplexity

Neural Information Processing SystemsFeb-7-2026, 11:13:29 GMT

Model-based reinforcement learning (RL), which finds an optimal policy using anempirical model, has long been recognized asone ofthe cornerstones ofRL.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.92)

Add feedback

0b9e57c46de934cee33b0e8d1839bfc2-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 11:05:10 GMT

algorithm, joint distribution, return distribution, (14 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Industry: Leisure & Entertainment > Games > Computer Games (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

06cbd2e81dfbd3bb4cb0abce95b32584-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 11:03:51 GMT

elimination order, machine learning, programming language, (23 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois (0.04)
North America > United States > Arkansas > Cross County (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Banking & Finance (0.67)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

0cddb777d3441326544e21b67f41bdc8-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 10:53:17 GMT

solution symmetricity, sym-nco, symmetricity, (13 more...)

Neural Information Processing Systems

Country: Asia > Thailand > Bangkok > Bangkok (0.04)

Industry: Transportation (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)

Add feedback

0b96d81f0494fde5428c7aea243c9157-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 10:45:46 GMT

algorithm, prediction, update rule, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Feint Behaviors and Strategies: Formalization, Implementation and Evaluation

Neural Information Processing SystemsFeb-7-2026, 10:43:46 GMT

Feint behaviors refer to a set of deceptive behaviors in a nuanced manner, which enable players to obtain temporal and spatial advantages over opponents in competitive games.

machine learning, natural language, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: