AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

82039d16dce0aab3913b6a7ac73deff7-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 04:15:33 GMT

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada (0.04)

Industry: Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)

Add feedback

81e793dc8317a3dbc3534ed3f242c418-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 04:13:51 GMT

Leveraging themodel-based nature ofDisCo,wecanalso readily compute anε/cmin-optimal policy for any cost-sensitive shortest-path problem defined on theL-controllable states with minimum costcmin.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)

Add feedback

58b286aea34a91a3d33e58af0586fa40-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 04:01:19 GMT

algorithm, graph, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.32)

Add feedback

5833b4daf5b076dd1cdb362b163dff0c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 03:59:40 GMT

international conference, mdp, task distribution, (13 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.04)
North America > United States > Massachusetts (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

68331ff0427b551b68e911eebe35233b-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 03:59:34 GMT

learner, reward feature, successor feature, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California (0.04)
(2 more...)

Industry: Education (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

68264bdb65b97eeae6788aa3348e553c-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 03:46:13 GMT

mapping function, source domain, target domain, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)

Add feedback

Anchor-Changing Regularized Natural Policy Gradientfor Multi-Objective Reinforcement Learning

Neural Information Processing SystemsFeb-9-2026, 03:44:36 GMT

Let = betheoptimalpolicyofthe CMDPproblemin (9). Theorem 3.ForanyK 1, takeuniformpolicy 0, 0 16 , 6 (1 )3, = 1 , and tk =d 11 log (5LK6 log (|A|))+1 e.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Texas > Brazos County > College Station (0.05)

Industry: Government > Regional Government > North America Government > United States Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback