AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

d0d5dd7bd2ee9f095e50084c2ba3a716-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 01:36:53 GMT

algorithm, imitation, learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Arizona (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre:

Instructional Material > Online (0.50)
Research Report (0.46)

Industry: Education > Educational Setting > Online (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)

Add feedback

Model-based Lifelong Reinforcement Learning with Bayesian Exploration

Neural Information Processing SystemsFeb-12-2026, 01:36:30 GMT

Thisoptimizationcan beperformedinparallelforeachs, keeping t 1 fixed.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)

Add feedback

fd0a5a5e367a0955d81278062ef37429-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 01:12:59 GMT

counterfactual explanation, explanation, realization, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.93)
Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Health & Medicine > Therapeutic Area > Internal Medicine (0.46)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.35)

Add feedback

fd06b8ea02fe5b1c2496fe1700e9d16c-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 01:12:20 GMT

artificial intelligence, international conference, representation, (10 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.85)

Add feedback

ProvableRepresentationLearningforImitation withContrastiveFourierFeatures

Neural Information Processing SystemsFeb-12-2026, 01:11:08 GMT

Inthiswork,wefocuson imitation learning, where the aim is to learn how to act in the environment to match the behavior of some unknown targetpolicy [25].

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Genre: Instructional Material > Course Syllabus & Notes (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

Worst-Case Regret Bounds for Exploration via Randomized Value Functions

Daniel Russo

Neural Information Processing SystemsFeb-12-2026, 01:03:50 GMT

This paper studies a recent proposal to use randomized value functions to drive exploration in reinforcement learning.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.35)

Add feedback

LearningLargeNeighborhoodSearchPolicyfor IntegerProgramming

Neural Information Processing SystemsFeb-12-2026, 01:03:27 GMT

However, the combinatorial number of variable subsets prevents direct application of typical RL algorithms.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.05)
Asia > China > Shandong Province > Qingdao (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)

Add feedback

d01bda31bbcd780774ff15b534e03c40-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 01:03:20 GMT

Reinforcement learning (RL) algorithms often require a large number of data samples to learn a control policy. As a result, training them directly on the real-world systems is expensive and potentially dangerous.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: