Review for NeurIPS paper: Joint Policy Search for Multi-agent Collaboration with Imperfect Information

Neural Information Processing Systems 

This paper presents the concept of policy density change for collaborative imperfect information games. All the reviewers agree that the idea is novel, appreciating the results in small games and in a much larger game of bridge (in particular, a comparison vs. WBridge5). There are several problems identified that the reviewers agree to be characterized as minor enough to be address in the final copy. As noted, there are problems with the comparison to WBridge5 and the authors have agreed to change their claim as a result. Clarifications on the connections to CFR and subgame decomposition should be made.