AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

1a669e81c8093745261889539694be7f-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 16:04:07 GMT

Inthesecondexample, vacuuming the floors of a house has certain risks, but the consequences of optimizing the wrong rewardfunction arearguably muchlesssignificant.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Robots (0.70)

Add feedback

19aa6c6fb4ba9fcf39e893ff1fd5b5bd-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 15:55:44 GMT

learner, reward function, trajectory, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Lombardy > Milan (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(7 more...)

Industry: Transportation (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

19aa6c6fb4ba9fcf39e893ff1fd5b5bd-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 15:55:36 GMT

algorithm, learner, reward function, (14 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Lombardy > Milan (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(7 more...)

Industry: Transportation (0.69)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

19aa6c6fb4ba9fcf39e893ff1fd5b5bd-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 15:55:25 GMT

experiment, learner, reward function, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

main_final

Neural Information Processing SystemsFeb-7-2026, 15:54:44 GMT

arxiv preprint arxiv, disturbance attenuation problem, matrix, (11 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Illinois (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.46)

Industry: Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

1663fba7b56da1e96bed6e30546a07b0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 15:35:14 GMT

machine learning, natural language, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(5 more...)

Add feedback

FindingCounterfactuallyOptimalActionSequences inContinuousStateSpaces

Neural Information Processing SystemsFeb-7-2026, 15:26:04 GMT

However, in many practical applications, the state of the environment is inherently continuous innature.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)

Add feedback

165a59f7cf3b5c4396ba65953d679f17-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 15:16:31 GMT

agent, learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Gradient Regularized V-Learning for Dynamic Treatment Regimes

Neural Information Processing SystemsFeb-7-2026, 15:06:06 GMT

Treatment individualization and adaptation over time are crucial for managing chronic diseases.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Indiana > Marion County > Indianapolis (0.04)
North America > Canada (0.04)

Genre: Research Report (0.69)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.93)
Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples Hao Sun

Neural Information Processing SystemsFeb-7-2026, 15:05:48 GMT

We assess AOC's performance in both simulated and real-world healthcare scenarios, emphasizing its capability to manage offline control tasks with high

artificial intelligence, machine learning, reinforcement learning, (11 more...)

Neural Information Processing Systems

Country: