Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information
–Neural Information Processing Systems
One principal approach for illuminating a black-box neural network is feature attribution, i.e. identifying the importance of input features for the network's prediction. The predictive information of features is recently proposed as a proxy for the measure of their importance. So far, the predictive information is only identified for latent features by placing an information bottleneck within the network. We propose a method to identify features with predictive information in the input domain. The method results in fine-grained identification of input features' information and is agnostic to network architecture.
fine-grained neural network explanation, identifying input feature, predictive information, (1 more...)
Neural Information Processing Systems
Jan-18-2025, 13:07:21 GMT
- Technology: