RobustDeepReinforcementLearning throughAdversarialLoss

Neural Information Processing Systems 

Our RADIAL-RL agents consistently outperform prior methods when tested against attacks of varying strength and are more computationally efficient to train. In addition, we propose a new evaluation method calledGreedyWorst-Case Reward(GWC) tomeasure attack agnostic robustness of deep RL agents. We show that GWC can be evaluated efficiently and is a good estimate of the reward under the worst possible sequence of adversarial attacks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found