Policy Invariance under Reward Transformations for General-Sum Stochastic Games
Lu, X., Schwartz, H. M., Givigi, S. N.
–Journal of Artificial Intelligence Research
We extend the potential-based shaping method from Markov decision processes to multi-player general-sum stochastic games. We prove that the Nash equilibria in a stochastic game remains unchanged after potential-based shaping is applied to the environment. The property of policy invariance provides a possible way of speeding convergence when learning to play a stochastic game.
Journal of Artificial Intelligence Research
Jul-29-2011
- Country:
- Asia > Taiwan
- Taiwan Province > Taipei (0.04)
- Europe > United Kingdom
- England > Greater London > London (0.04)
- North America
- Canada > Ontario
- Kingston (0.04)
- National Capital Region > Ottawa (0.14)
- United States
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Massachusetts > Middlesex County
- Canada > Ontario
- Asia > Taiwan
- Industry:
- Technology: