ResQ: A Residual Q Function-based Approach for Multi-Agent Reinforcement Learning Value Factorization
–Neural Information Processing Systems
The factorization of state-action value functions for Multi-Agent Reinforcement Learning (MARL) is important. Existing studies are limited by their representation capability, sample efficiency, and approximation error. To address these challenges, we propose, ResQ, a MARL value function factorization method, which can find the optimal joint policy for any state-action value function through residual functions. ResQ masks some state-action value pairs from a joint state-action value function, which is transformed as the sum of a main function and a residual function. ResQ can be used with mean-value and stochastic-value RL.
artificial intelligence, machine learning, multi-agent reinforcement learning value factorization, (6 more...)
Neural Information Processing Systems
Jan-24-2025, 23:29:39 GMT
- Technology: