Review for NeurIPS paper: Strictly Batch Imitation Learning by Energy-based Distribution Matching
–Neural Information Processing Systems
All reviewers unanimously agree that the paper makes a nice contribution to imitation learning in the batch setting. That said, the paper has two major weaknesses: 1. During the discussion, the reviewers expressed confidence that the authors understand the mistake and know how to address it (see e.g., the post-rebuttal update of R4). Therefore, we are recommending acceptance conditioned on that the authors take this issue seriously, correct the technical mistake, and remove any incorrect or misleading claims associated with it. The authors are strongly recommended to add such a comparison in the camera-ready version. On a related note, while the algorithm only uses (s,a) pairs as data, trajectory data is often available, from which one can extract (s,a,r,s') pairs.
Neural Information Processing Systems
Jan-24-2025, 10:28:56 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.98)
- Robots (0.65)
- Information Technology > Artificial Intelligence