Review for NeurIPS paper: Multi-agent active perception with prediction rewards

Neural Information Processing Systems 

This paper addresses the problem of multiagent active perception, a somewhat nascent area, and proposes a new reformulation of Dec-rho-POMDPs into a DEC-POMDP though the addition of a final-stage "predictive action." The reviewers appreciated the novelty of this contribution as well as the theoretical analysis/loss bounds. The original reviews raised a number of questions however, and the author response addressed many of these. However, there remain some issues that undercut the significance of the contribution, including: the somewhat incremental combination/adaptation of existing techniques; the fact that the claimed scalability is not demonstrated very convincingly in the experiments; among others. On my reading of the paper, I largely concur and do not reiterate the positive contributions in the other reviews, but point out some concerns about importance/impact: 1.