Goto

Collaborating Authors

 policy-change density



We thank reviewers (R1

Neural Information Processing Systems

Therefore, our paper could have stronger implications than we expect. We disagree with R2 that the tabular form of JPS indeed has theoretical guarantees, as appreciated by other reviewers. Full game AI is a future work. This change leads to very different (and novel) theoretical insights. It leads to policy-change decomposition in Thm. 1, We will add comparisons in the next version.