Provably Efficient Black-Box Action Poisoning Attacks Against Reinforcement Learning

Oct-10-2024, 21:36:42 GMT–Neural Information Processing Systems

Due to the broad range of applications of reinforcement learning (RL), understanding the effects of adversarial attacks against RL model is essential for the safe applications of this model. Prior theoretical works on adversarial attacks against RL mainly focus on either reward poisoning attacks or environment poisoning attacks. In this paper, we introduce a new class of attacks named action poisoning attacks, where an adversary can change the action signal selected by the agent. Compared with existing attack models, the attacker's ability in the proposed action poisoning attack model is more restricted, which brings some design challenges. We study the action poisoning attack in both white-box and black-box settings.

agent, efficient black-box action poisoning attack, reinforcement learning, (7 more...)

Neural Information Processing Systems

Oct-10-2024, 21:36:42 GMT

Conferences Web Page

Add feedback

Industry:
- Transportation > Air (0.72)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)