AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

2051bd70fc110a2208bdbd4a743e7f79-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 18:34:30 GMT

In recent years, we have witnessed tremendous progress in deep reinforcement learning(RL)fortaskssuchasGo,Chess,videogames,androbotcontrol.

machine learning, reinforcement learning, rl agent, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Leisure & Entertainment > Games (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

1e747ddbea997a1b933aaf58a7953c3c-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 18:34:13 GMT

machine learning, natural language, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)

Genre: Overview (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
(2 more...)

1e4d36177d71bbb3558e43af9577d70e-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 18:25:36 GMT

autombpo, hyperparameter, policy training iteration, (13 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

OnEffectiveSchedulingofModel-based ReinforcementLearning

Neural Information Processing SystemsFeb-7-2026, 18:25:32 GMT

Model-based reinforcement learning has attracted wide attention due to its superior sample efficiency.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

LearningDynamicBeliefGraphstoGeneralize onText-BasedGames

Neural Information Processing SystemsFeb-7-2026, 18:14:20 GMT

GATAis trained using acombination of reinforcement and self-supervised learning. Our workdemonstrates thatthelearned graph-based representations helpagents converge to better policies than their text-only counterparts and facilitate effective generalization across game configurations.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

FederatedEnsemble-Directed OfflineReinforcementLearning

Neural Information Processing SystemsFeb-7-2026, 18:05:03 GMT

We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policyonly using small pre-collected datasets generated according to different unknown behavior policies. Naïvely combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control environments and realworld datasets.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Virginia (0.04)

Industry: Energy (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

1cd73be1e256a7405516501e94e892ac-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 17:46:25 GMT

arxiv preprint arxiv, exploitability, oracle, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Orange County > Irvine (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

1cbcaa5abbb6b70f378a3a03d0c26386-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 17:44:41 GMT

cil task, learning, policy function, (17 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Saarland (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Aachen (0.04)
Asia > Singapore (0.04)
Asia > China > Liaoning Province > Shenyang (0.04)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)

1a0755b249b772ed5529796b0a7cc9bd-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 17:44:10 GMT

dataset, learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Beijing > Beijing (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)