AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

OfflineRLWithoutOff-PolicyEvaluation

Neural Information Processing SystemsFeb-7-2026, 22:53:31 GMT

Inaddition, wehypothesize thatthestrong performance of the one-step algorithm is due to a combination of favorable structure in the environmentandbehaviorpolicy.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

2c6a0bae0f071cbbf0bb3d5b11d90a82-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 22:53:00 GMT

agent, dqn, m-dqn, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Russia (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

2be5f9c2e3620eb73c2972d7552b6cb5-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 22:43:08 GMT

mdp homomorphism, symmetry, transformation, (10 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Holland > Amsterdam (0.05)
North America > United States > Massachusetts (0.04)
North America > Canada (0.04)
Europe > Netherlands > South Holland > Delft (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Neural Information Processing SystemsFeb-7-2026, 22:35:47 GMT

When the collected dataset is good, i.e., mostly

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Government (0.67)
Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

2bba9f4124283edd644799e0cecd45ca-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 22:35:34 GMT

The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including valuealignment androbot learning fromdemonstration.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

249f73e01f0a2bb6c8d971b565f159a7-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 22:24:29 GMT

algorithm, evaluation, marl, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.75)

2456a42386e445ba884511aa17ca4a30-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 22:15:03 GMT

machine learning, reinforcement learning, resq, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

256bf8e6923a52fda8ddf7dc050a1148-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 22:13:46 GMT

One method of obtaining such a representation uses the notion of abisimulation metric(BSM) [13, 14].

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > Canada > Ontario > Toronto (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

29e48b79ae6fc68e9b6480b677453586-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 22:12:59 GMT

algorithm, attendlight, intersection, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > North Carolina > Wake County > Cary (0.05)
Asia > China > Zhejiang Province > Hangzhou (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry:

Health & Medicine (0.68)
Transportation > Infrastructure & Services (0.50)
Transportation > Ground > Road (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

294e09f267683c7ddc6cc5134a7e68a8-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 21:45:22 GMT

arxiv preprint arxiv, coda, learning, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
(2 more...)