AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

a613863f6a3ada47ae5bca2a558872d1-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 11:46:38 GMT

demonstration, supervision, weak supervision, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.04)

Industry:

Education (0.93)
Leisure & Entertainment > Games (0.68)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

8636419dea1aa9fbd25fc4248e702da4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 11:22:01 GMT

program synthesis, synthesis, unit test, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > Puerto Rico > San Juan > San Juan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

OnReward-FreeReinforcementLearningwith LinearFunctionApproximation

Neural Information Processing SystemsFeb-10-2026, 11:12:37 GMT

During the exploration phase, an agent collects samples without using a pre-specified reward function. After the exploration phase, a reward function is given, and the agent uses samples collected during the exploration phase to computeanear-optimalpolicy.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Provably Efficient Multi-Task Reinforcement Learning with Model Transfer

Neural Information Processing SystemsFeb-10-2026, 10:58:16 GMT

We study multi-task reinforcement learning (RL) in tabular episodic Markov decision processes (MDPs).

machine learning, reinforcement learning, state-action pair, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Arizona (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

cceff8faa855336ad53b3325914caea2-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 10:14:19 GMT

actor, learning, transition, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom (0.04)
Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

a35fe7f7fe8217b4369a0af4244d1fca-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 10:13:30 GMT

complexity, representation, space invader, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > France (0.04)
(2 more...)

Industry:

Law (0.46)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

cc9b3c69b56df284846bf2432f1cba90-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 09:58:56 GMT

actor-critic algorithm, algorithm, inequality, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

A Finite-Time Analysis of Two Time-Scale Actor-Critic Methods

Neural Information Processing SystemsFeb-10-2026, 09:58:48 GMT

In this work, we provide a non-asymptotic analysis for two timescale actor-critic methods under non-i.i.d.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.30)
Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

8443219a991f068c34d9491ad68ffa94-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 09:44:22 GMT

arxiv preprint arxiv, contextual bandit, linear contextual bandit, (10 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)