AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

45e15bae91a6f213d45e203b8a29be48-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 16:05:01 GMT

data mining, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Massachusetts > Norfolk County > Wellesley (0.04)
North America > United States > Arizona > Maricopa County > Scottsdale (0.04)
(3 more...)

Genre:

Research Report (0.47)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

Non-AsymptoticAnalysisforTwoTime-scaleTDC withGeneralSmoothFunctionApproximation

Neural Information Processing SystemsFeb-8-2026, 15:55:57 GMT

Temporaldifference(TD)learning algorithm is one of the most popular policy evaluation approaches.

approximation, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Utah (0.04)
North America > United States > New York > Erie County > Buffalo (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Percentile Criterion Optimization in Offline Reinforcement Learning

Neural Information Processing SystemsFeb-8-2026, 15:55:28 GMT

In reinforcement learning, robust policies for high-stakes decision-making problems with limited data are usually computed by optimizing the percentile criterion .

ambiguity, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(4 more...)

Genre: Research Report > New Finding (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

455e1e30edf721bd7fa334fffabdcad8-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 15:54:39 GMT

algorithm, dataset, sequence, (16 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Temporally-ConsistentSurvivalAnalysis

Neural Information Processing SystemsFeb-8-2026, 15:54:35 GMT

Wemodel theeventofinterest asaspecial terminal state, andwe seek to estimate the survival distribution (i.e., the distribution of the hitting time for that terminal state) from anyother state.

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country: