Reviews: Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization

Oct-7-2024, 10:49:17 GMT–Neural Information Processing Systems

The article extends previous work of primal-dual optimisation for policy evaluation in RL to the distributed policy evaluation setting, maintaining attractive convergence rates for the extended algorithm. Overall, the article gradually builds its contribution and is reasonably easy to follow. A few exception to this are the start of related work, dropping citations in lists, and the lack of an explanation of the repeatedly mentioned'convex-concave saddle-point problem'. The authors equate averaging over'agents' with averaging over'space', which is somewhat of an imprecise metaphorical stretch in my view. The contribution is honestly delineated (collaborative distributed policy evaluation with local rewards), and relevant related work is cited clearly.

double averaging primal-dual optimization, multi-agent reinforcement learning, policy evaluation, (2 more...)

Neural Information Processing Systems

Oct-7-2024, 10:49:17 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)