policy-change density
We thank reviewers (R1
Therefore, our paper could have stronger implications than we expect. We disagree with R2 that the tabular form of JPS indeed has theoretical guarantees, as appreciated by other reviewers. Full game AI is a future work. This change leads to very different (and novel) theoretical insights. It leads to policy-change decomposition in Thm. 1, We will add comparisons in the next version.