MADIFF: OfflineMulti-agentLearning withDiffusionModels
–Neural Information Processing Systems
Offline reinforcement learning (RL) aims to learn policies from pre-existing datasets without further interactions, making it a challenging task. Q-learning algorithms struggle withextrapolation errors inofflinesettings, while supervised learning methods are constrained by model expressiveness.
Neural Information Processing Systems
Feb-7-2026, 12:25:23 GMT
- Country:
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Asia > China
- North America > United States
- Genre:
- Workflow (0.46)
- Research Report (0.46)
- Technology: