Review for NeurIPS paper: Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms

Jan-25-2025, 03:53:40 GMT–Neural Information Processing Systems

Weaknesses: W1: The study seems to focus too much on algorithms that are based on safety tests. I understand that the analysis is not compatible, but maybe that would be worth it to include studies on how easy it is to trick those algorithms too. More generally (even for IS algorithms), it was a bit odd to me that the study does not consider attacks on the way pi_e is chosen. W2: It's unclear to me whether the trajectory must still have been performed in the real environment, or it can be completely be made up (but then its value has to be within the range [0,1]). Also, with model based methods (for both environment and policy models), it might be possible to single out the few trajectories that are inconsistent with the other trajectories.

security analysis, seldonian reinforcement learning algorithm, trajectory, (3 more...)

Neural Information Processing Systems

Jan-25-2025, 03:53:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)