Fully Decentralized Policies for Multi-Agent Systems: An Information Theoretic Approach
Dobbe, Roel, Fridovich-Keil, David, Tomlin, Claire
–Neural Information Processing Systems
Learning cooperative policies for multi-agent systems is often challenged by partial observability and a lack of coordination. In some settings, the structure of a problem allows a distributed solution with limited communication. Here, we consider a scenario where no communication is available, and instead we learn local policies for all agents that collectively mimic the solution to a centralized multi-agent static optimization problem. Our main contribution is an information theoretic framework based on rate distortion theory which facilitates analysis of how well the resulting fully decentralized policies are able to reconstruct the optimal solution. Moreover, this framework provides a natural extension that addresses which nodes an agent should communicate with to improve the performance of its individual policy.
Neural Information Processing Systems
Dec-31-2017
- Country:
- Europe > Italy
- North America
- Canada > British Columbia
- The Bahamas > New Providence
- Nassau (0.04)
- United States
- Arizona (0.04)
- California
- Alameda County > Berkeley (0.14)
- Los Angeles County
- Long Beach (0.04)
- Los Angeles (0.14)
- San Francisco County > San Francisco (0.14)
- Massachusetts > Suffolk County
- Boston (0.04)
- New York > New York County
- New York City (0.04)
- Industry:
- Energy
- Power Industry (1.00)
- Renewable (0.93)
- Energy