Multi-Agent Advisor Q-Learning
Subramanian, Sriram Ganapathi, Taylor, Matthew E., Larson, Kate, Crowley, Mark
–arXiv.org Artificial Intelligence
In the last decade, there have been significant advances in multi-agent reinforcement learning (MARL) but there are still numerous challenges, such as high sample complexity and slow convergence to stable policies, that need to be overcome before wide-spread deployment is possible. However, many real-world environments already, in practice, deploy sub-optimal or heuristic approaches for generating policies. An interesting question which arises is how to best use such approaches as advisors to help improve reinforcement learning in multi-agent domains. In this paper, we provide a principled framework for incorporating action recommendations from online sub-optimal advisors in multi-agent settings. We describe the problem of ADvising Multiple Intelligent Reinforcement Agents (ADMIRAL) in nonrestrictive general-sum stochastic game environments and present two novel Q-learning based algorithms: ADMIRAL - Decision Making (ADMIRAL-DM) and ADMIRAL - Advisor Evaluation (ADMIRAL-AE), which allow us to improve learning by appropriately incorporating advice from an advisor (ADMIRAL-DM), and evaluate the effectiveness of an advisor (ADMIRAL-AE). We analyze the algorithms theoretically and provide fixed-point guarantees regarding their learning in general-sum stochastic games. Furthermore, extensive experiments illustrate that these algorithms: can be used in a variety of environments, have performances that compare favourably to other related baselines, can scale to large state-action spaces, and are robust to poor advice from advisors.
arXiv.org Artificial Intelligence
Nov-8-2021
- Country:
- South America
- Brazil > São Paulo (0.04)
- Argentina > Pampas
- Buenos Aires F.D. > Buenos Aires (0.04)
- Oceania
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- Australia
- Queensland > Brisbane (0.04)
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- Australian Capital Territory > Canberra (0.04)
- New Zealand > North Island
- North America
- United States
- District of Columbia > Washington (0.04)
- Rocky Mountains (0.04)
- Colorado > Denver County
- Denver (0.14)
- Florida > Broward County
- Fort Lauderdale (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York
- New York County > New York City (0.14)
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Bellevue (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California
- Los Angeles County > Long Beach (0.14)
- Santa Barbara County > Santa Barbara (0.04)
- Santa Clara County
- Stanford (0.04)
- Mountain View (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Rocky Mountains (0.04)
- Quebec
- Montreal (0.04)
- Capitale-Nationale Region
- Québec (0.04)
- Quebec City (0.04)
- Ontario > Waterloo Region
- Waterloo (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Alberta
- United States
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Sweden
- Stockholm > Stockholm (0.04)
- Skåne County > Malmö (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Granada Province
- Granada (0.04)
- Catalonia > Barcelona Province
- Slovenia > Upper Carniola
- Municipality of Bled > Bled (0.04)
- Italy > Sardinia
- Cagliari (0.04)
- Germany > Baden-Württemberg
- Tübingen Region > Tübingen (0.04)
- France
- Île-de-France > Paris
- Paris (0.04)
- Grand Est > Meurthe-et-Moselle
- Nancy (0.04)
- Île-de-France > Paris
- United Kingdom > England
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Middle East
- Japan > Honshū
- Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)
- Taiwan > Taiwan Province
- South America
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: