Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

Jan-18-2025, 13:07:21 GMT–Neural Information Processing Systems

One principal approach for illuminating a black-box neural network is feature attribution, i.e. identifying the importance of input features for the network's prediction. The predictive information of features is recently proposed as a proxy for the measure of their importance. So far, the predictive information is only identified for latent features by placing an information bottleneck within the network. We propose a method to identify features with predictive information in the input domain. The method results in fine-grained identification of input features' information and is agnostic to network architecture.

fine-grained neural network explanation, identifying input feature, predictive information, (1 more...)

Neural Information Processing Systems

Jan-18-2025, 13:07:21 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Information Management (1.00)
  - Data Science (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks (0.66)