Topological Detection of Trojaned Neural Networks
–Neural Information Processing Systems
Deep neural networks are known to have security issues. One particular threat is the Trojan attack. It occurs when the attackers stealthily manipulate the model's behavior through Trojaned training samples, which can later be exploited. Guided by basic neuroscientific principles, we discover subtle -- yet critical -- structural deviation characterizing Trojaned models. In our analysis we use topological tools.
Neural Information Processing Systems
Jan-16-2025, 18:50:46 GMT
- Technology: