Improving Deep Learning Interpretability by Saliency Guided Training
–Neural Information Processing Systems
Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions.
Neural Information Processing Systems
May-23-2025, 09:32:11 GMT