Reviews: Neural Interaction Transparency (NIT): Disentangling Learned Interactions for Improved Interpretability

Oct-7-2024, 13:53:10 GMT–Neural Information Processing Systems

This paper proposes a novel approach to more interpretable learning in neural networks. In particular, it addresses the common criticism that the computations performed by neural networks are often hard to intuitively interpret, which can be a problem in applications, e.g. in the medical or financial fields. The authors suggest adding a novel regulariser to the weights of first layer of a neural network to discover and preserve non-additive interactions in the data features up to a chosen order and preserve these relationships without entangling them together (e.g. These interactions could then be further processed by the separate columns of the neural network. The approach was evaluated on a number of datasets and seems to perform similarly to the baselines on regression and classification tasks, while being more interpretable and less computationally expensive.

disentangling learned interaction, neural interaction transparency, neural network, (8 more...)

Neural Information Processing Systems

Oct-7-2024, 13:53:10 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.57)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)