Manipulating Feature Visualizations with Gradient Slingshots
–Neural Information Processing Systems
Feature Visualization (FV) is a widely used technique for interpreting concepts learned by Deep Neural Networks (DNNs), which synthesizes input patterns that maximally activate a given feature. Despite its popularity, the trustworthiness of FV explanations has received limited attention. We introduce Gradient Slingshots, a novel method that enables FV manipulation without modifying model architecture or significantly degrading performance.
Neural Information Processing Systems
Jun-14-2026, 07:32:07 GMT
- Technology: