Policy Gradient With Value Function Approximation For Collective Multiagent Planning
Duc Thien Nguyen, Akshat Kumar, Hoong Chuin Lau
–Neural Information Processing Systems
Neural Information Processing Systems
Apr-23-2026, 19:21:52 GMT
Duc Thien Nguyen, Akshat Kumar, Hoong Chuin Lau
–Neural Information Processing Systems
Neural Information Processing Systems
Apr-23-2026, 19:21:52 GMT