ladv
RobustDeepReinforcementLearning throughAdversarialLoss
Our RADIAL-RL agents consistently outperform prior methods when tested against attacks of varying strength and are more computationally efficient to train. In addition, we propose a new evaluation method calledGreedyWorst-Case Reward(GWC) tomeasure attack agnostic robustness of deep RL agents. We show that GWC can be evaluated efficiently and is a good estimate of the reward under the worst possible sequence of adversarial attacks.
Country:
- Oceania > Australia > New South Wales > Sydney (0.04)
- North America > United States (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)