Review for NeurIPS paper: Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Jan-25-2025, 19:08:02 GMT–Neural Information Processing Systems

The proof of your theory lacks discussion of POMDP settings. Although the framework in focused in solving the Dec-POMDP problem, most parts of the proof are under MDP setting. But there is no more discussion on that phenomenon. The use of weighting is not that convinced. In Section 6.2.3, the performance of the Weighted QMIX method is unacceptable.

deep multi-agent reinforcement learning, expanding monotonic value function factorisation, neurips paper, (2 more...)

Neural Information Processing Systems

Jan-25-2025, 19:08:02 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)