Online Robust Policy Learning in the Presence of Unknown Adversaries

Aaron Havens, Zhanhong Jiang, Soumik Sarkar

Neural Information Processing Systems 

Recent work on generating adversarial attacks have shown that it is computationally feasible for a bad actor to fool a DRL policy into behaving sub optimally.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found