RobustDeepReinforcementLearning throughAdversarialLoss
–Neural Information Processing Systems
Our RADIAL-RL agents consistently outperform prior methods when tested against attacks of varying strength and are more computationally efficient to train. In addition, we propose a new evaluation method calledGreedyWorst-Case Reward(GWC) tomeasure attack agnostic robustness of deep RL agents. We show that GWC can be evaluated efficiently and is a good estimate of the reward under the worst possible sequence of adversarial attacks.
Neural Information Processing Systems
Feb-11-2026, 11:35:22 GMT
- Technology: