Review for NeurIPS paper: Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations

Feb-8-2025, 01:46:23 GMT–Neural Information Processing Systems

Clarity: *** Derivations in Section 3 *** While the theorems across Section 3.1 seem reasonable I would have liked some a more self-contained presentation of theorems together with proofs. Assumption 2 (Bounded adversary power) is a bit strange, and while the experimental implementation (with the norm ball around s) seems reasonable for many environments, this should probably be defined in a better way. The authors refer to the Appendix a lot and in my opinion such derivations are necessary for the reader to follow along. I cannot really follow how the authors get there. Add Plots (similar to Appendix I, Figure 12).

adversarial perturbation, robust deep reinforcement learning, state observation, (5 more...)

Neural Information Processing Systems

Feb-8-2025, 01:46:23 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)