Extending Q-Learning to General Adaptive Multi-Agent Systems

Dec-31-2004–Neural Information Processing Systems

Recent multi-agent extensions of Q-Learning require knowledge of other agents' payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This paper proposes a fundamentally different approach, dubbed "Hyper-Q" Learning, in which values of mixed strategies rather than base actions are learned, and in which other agents' strategies are estimated from observed actions via Bayesian inference. Hyper-Qmay be effective against many different types of adaptive agents, even if they are persistently dynamic. Against certain broad categories of adaptation, it is argued that Hyper-Q may converge to exact optimaltime-varying policies. In tests using Rock-Paper-Scissors, Hyper-Q learns to significantly exploit an Infinitesimal Gradient Ascent (IGA) player, as well as a Policy Hill Climber (PHC) player. Preliminary analysis of Hyper-Q against itself is also presented.

machine learning, mixed strategy, reinforcement learning, (19 more...)

Neural Information Processing Systems

Dec-31-2004

Conferences PDF

Add feedback

Industry:
- Leisure & Entertainment > Games (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Agents (1.00)
    - Uncertainty > Bayesian Inference (0.89)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.68)

Duplicate Docs Excel Report

Title
Extending Q-Learning to General Adaptive Multi-Agent Systems
Extending Q-Learning to General Adaptive Multi-Agent Systems

Similar Docs Excel Report more

Title	Similarity	Source
None found