AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

Semantic HELM: A Human-Readable Memory for Reinforcement Learning Fabian Paischer 1, Thomas Adler

Neural Information Processing SystemsFeb-8-2026, 17:46:02 GMT

In this regard, we propose a novel memory mechanism that represents past events in human language.

large language model, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Baltimore (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
(21 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine (1.00)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

48adb34f7ee39177c4c23a8e4253a492-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 17:45:17 GMT

adversarial example, agent, perturbation, (14 more...)

Neural Information Processing Systems

Country:

Asia > Taiwan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.68)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
(2 more...)

OntheEstimationBiasinDoubleQ-Learning

Neural Information Processing SystemsFeb-8-2026, 17:44:47 GMT

One of the phenomena of interest is that Q-learning (Watkins, 1989) is known to suffer from overestimation issues, since it takes a maximum operator overaset ofestimated action-values.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

6734fa703f6633ab896eecbdfad8953a-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 17:27:00 GMT

assumption 3, sample complexity, trajectory, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)

ProvablyFeedback-EfficientReinforcementLearning viaActiveRewardLearning

Neural Information Processing SystemsFeb-8-2026, 17:14:21 GMT

Here H is the horizon oftheRL environment, anddimR specifies thecomplexity ofthefunction class representing the reward function.

machine learning, pmlr, reinforcement learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

661b1e76b95cc50a7a11a85619a67d95-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 17:06:56 GMT

international conference, trajectory, world model, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(16 more...)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

661b1e76b95cc50a7a11a85619a67d95-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 17:06:49 GMT

international conference, trajectory, world model, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.05)
(16 more...)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

SecurityAnalysisofSafeandSeldonian ReinforcementLearningAlgorithms

Neural Information Processing SystemsFeb-8-2026, 17:04:40 GMT

This component makes current Seldonian algorithms safe: the safety test checks whether necessary safety constraints are satisfiedwithhighprobability.

machine learning, reinforcement learning, trajectory, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry:

Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.96)
Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

47561f5e1dc53c7d119185e217b523d0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 16:57:07 GMT

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry:

Information Technology (0.46)
Energy (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

19a42d5885e25e51852aca8144e5af0d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 16:56:57 GMT

algorithm, fedlsa, heterogeneous, (15 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)
North America > United States > Virginia (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.92)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)