AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

0a113ef6b61820daa5611c870ed8d5ee-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 10:34:42 GMT

overestimation bias, softmax operator, value estimate, (11 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

0bf54b80686d2c4dc0808c2e98d430f7-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-7-2026, 10:33:34 GMT

Productsthatletyouborrow, save, invest, trade, andmore; and 3). ""FinRL is the first open-source framework for financial reinforcement learning.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > New York (0.04)

Industry:

Banking & Finance > Trading (1.00)
Education > Educational Setting > Online (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)

Add feedback

09dbc1177211571ef3e1ca961cc39363-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 10:25:31 GMT

learner, suboptimality, trajectory, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > California > Alameda County > Berkeley (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(4 more...)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.41)

Add feedback

0bc795afae289ed465a65a3b4b1f4eb7-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 10:25:09 GMT

gausssine, int, learning, (13 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Education (0.93)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.68)
(2 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

061d5d1b7d97117764f205d4e038f9eb-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 10:23:37 GMT

arxiv preprint arxiv, concentration strength, experiment, (9 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
(2 more...)

Add feedback

OfflineReinforcementLearningasOneBig SequenceModelingProblem

Neural Information Processing SystemsFeb-7-2026, 10:15:10 GMT

Reinforcement learning (RL) is typically concerned with estimating stationary policies orsingle-step models, leveraging theMarkovproperty tofactorize problems in time. However, we can also view RL as a generic sequence modeling problem, with the goal being to produce a sequence of actions that leads to a sequence ofhighrewards.

machine learning, natural language, reinforcement learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning James Queeney

Neural Information Processing SystemsFeb-7-2026, 10:14:04 GMT

Many real-world domains require safe decision making in uncertain environments. In this work, we introduce a deep reinforcement learning framework for approaching this important problem. We consider a distribution over transition models, and apply a risk-averse perspective towards model uncertainty through the use of coherent distortion risk measures.

machine learning, model uncertainty, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback