Policy Invariance under Reward Transformations for General-Sum Stochastic Games

Jul-29-2011–Journal of Artificial Intelligence Research

We extend the potential-based shaping method from Markov decision processes to multi-player general-sum stochastic games. We prove that the Nash equilibria in a stochastic game remains unchanged after potential-based shaping is applied to the environment. The property of policy invariance provides a possible way of speeding convergence when learning to play a stochastic game.

equilibrium policy, matrix game, stochastic game, (12 more...)

Journal of Artificial Intelligence Research

Jul-29-2011

Journals PDF

Add feedback

Country:
- North America
  - United States
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
  - Canada > Ontario
    - National Capital Region > Ottawa (0.14)
    - Kingston (0.04)
- Europe > United Kingdom
  - England > Greater London > London (0.04)
- Asia > Taiwan
  - Taiwan Province > Taipei (0.04)

Industry:
- Leisure & Entertainment (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.95)
  - Representation & Reasoning > Agents (0.94)