AITopics | Reinforcement Learning

BC formulates imitation learning as a supervised learning problem. It needs no in-environment samples, but it suffers from the covariate shift issue [37], often leading totesttimeperformance degradation.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

11958dfee29b6709f48a9ba0387a2431-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 12:47:13 GMT

disjunctive graph, opération, scheduling problem, (12 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > United States (0.04)
North America > Canada (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.90)

Add feedback

Multi

Neural Information Processing SystemsFeb-7-2026, 12:44:41 GMT

However, there are still quite a few challenges between the traditional RL research and real-worldtasks.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

114292cf3f930ba157ed33f66997fee2-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 12:44:30 GMT

We characterise the phenomenon empirically, verifying that it is not limited to specific algorithm or environment properties.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Greater London > London (0.05)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

0f3d014eead934bbdbacb62a01dc4831-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 12:36:25 GMT

affordance, option model, temporally, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Industry: Transportation > Passenger (0.31)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

10a6bdcabbd5a3d36b760daa295f63c1-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 12:27:00 GMT

arxiv preprint arxiv, learning, reward function, (13 more...)

Neural Information Processing Systems

Country: Europe > Switzerland (0.04)

Industry:

Leisure & Entertainment > Games (0.68)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Data Science (0.93)
(3 more...)

Add feedback

MADIFF: OfflineMulti-agentLearning withDiffusionModels

Neural Information Processing SystemsFeb-7-2026, 12:25:23 GMT

Offline reinforcement learning (RL) aims to learn policies from pre-existing datasets without further interactions, making it a challenging task. Q-learning algorithms struggle withextrapolation errors inofflinesettings, while supervised learning methods are constrained by model expressiveness.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: