BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning

Neural Information Processing Systems 

By analyzing the unique properties and behaviors of backdoor attacks, we formulate trigger restoration as an optimization problem and design a novel metric to detect back-doored policies.