Goto

Collaborating Authors

 Reinforcement Learning





11715d433f6f8b9106baae0df023deb3-Paper-Conference.pdf

Neural Information Processing Systems

BC formulates imitation learning as a supervised learning problem. It needs no in-environment samples, but it suffers from the covariate shift issue [37], often leading totesttimeperformance degradation.



Multi

Neural Information Processing Systems

However, there are still quite a few challenges between the traditional RL research and real-worldtasks.





MADIFF: OfflineMulti-agentLearning withDiffusionModels

Neural Information Processing Systems

Offline reinforcement learning (RL) aims to learn policies from pre-existing datasets without further interactions, making it a challenging task. Q-learning algorithms struggle withextrapolation errors inofflinesettings, while supervised learning methods are constrained by model expressiveness.