A Missing statements and proofs 521 A.1 Statements for Section 3.1

Neural Information Processing Systems 

Let a two-player Markov game where both players affect the transition. As we have seen in Section 2.1, in the case of unilateral deviation from joint policy Let a (possibly correlated) joint policy ˆ σ . By Lemma A.1, we know that Where the equality holds due to the zero-sum property, (1). An approximate NE is an approximate global minimum. An approximate global minimum is an approximate NE.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found