RobustDeepReinforcementLearning throughAdversarialLoss

Feb-11-2026, 11:35:22 GMT–Neural Information Processing Systems

Our RADIAL-RL agents consistently outperform prior methods when tested against attacks of varying strength and are more computationally efficient to train. In addition, we propose a new evaluation method calledGreedyWorst-Case Reward(GWC) tomeasure attack agnostic robustness of deep RL agents. We show that GWC can be evaluated efficiently and is a good estimate of the reward under the worst possible sequence of adversarial attacks.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Feb-11-2026, 11:35:22 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
dbb422937d7ff56e049d61da730b3e11-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found