Review for NeurIPS paper: Deep Inverse Q-learning with Constraints

Neural Information Processing Systems 

Reviewers generally agreed that this paper proposes a novel IRL method that leverages the assumption that the expert demonstration is following a boltzmann distribution.