AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

8d6b1d775014eff18256abeb207202ad-Paper-Conference.pdf

Neural Information Processing SystemsAug-16-2025, 22:20:46 GMT

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland > Prince George's County > College Park (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (0.67)

Industry:

Information Technology > Security & Privacy (0.95)
Government > Military (0.95)
Leisure & Entertainment > Games > Computer Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Security & Privacy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

b0b79da57b95837f14be95aaa4d54cf8-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 21:40:39 GMT

data mining, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine Jiayi Weng Min Lin

Neural Information Processing SystemsAug-16-2025, 21:20:43 GMT

Deep Reinforcement Learning (RL) has made remarkable progress in the past years.

envpool, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Efficient Planning in Large MDPs with Weak Linear Function Approximation

Roshan Shariff & Csaba Szepesvári

Neural Information Processing SystemsAug-16-2025, 20:59:03 GMT

In this paper we consider the intersection of these two problem formulations.

artificial intelligence, machine learning, reinforcement learning, (21 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Asia > Middle East > Jordan (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.41)

Add feedback

Accelerating Quadratic Optimization with Reinforcement Learning

Neural Information Processing SystemsAug-16-2025, 20:41:31 GMT

Solving quadratic programs (QPs) efficiently is critical to applications in finance, robotic control and operations research.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(3 more...)

Add feedback

policy improvement $

Neural Information Processing SystemsAug-16-2025, 20:21:28 GMT

Setting up a well-designed reward function has been challenging for many reinforcement learning applications.

machine learning, q-function, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology (0.68)
Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.95)

Add feedback

Avoiding Side Effects By Considering Future Tasks Victoria Krakovna

Neural Information Processing SystemsAug-16-2025, 20:01:55 GMT

Designing reward functions for a reinforcement learning agent is often a difficult task.

agent, auxiliary reward, side effect, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Neural Information Processing SystemsAug-16-2025, 19:21:37 GMT

With increasing success in reinforcement learning (RL), there is broad interest in applying these methods to real-world settings. This has brought exciting progress in offline RL and off-policy policy evaluation (OPPE).

behavior policy, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country: