Hierarchical Interpretation of Neural Text Classification
Yan, Hanqi, Gui, Lin, He, Yulan
–arXiv.org Artificial Intelligence
Recent years have witnessed increasing interests in developing interpretable models in Natural Language Processing (NLP). Most existing models aim at identifying input features such as words or phrases important for model predictions. Neural models developed in NLP however often compose word semantics in a hierarchical manner and text classification requires hierarchical modelling to aggregate local information in order to deal with topic and label shifts more effectively. As such, interpretation by words or phrases only cannot faithfully explain model decisions in text classification. This paper proposes a novel Hierarchical INTerpretable neural text classifier, called Hint, which can automatically generate explanations of model predictions in the form of label-associated topics in a hierarchical manner. Model interpretation is no longer at the word level, but built on topics as the basic semantic unit. Experimental results on both review datasets and news datasets show that our proposed approach achieves text classification results on par with existing state-of-the-art text classifiers, and generates interpretations more faithful to model predictions and better understood by humans than other interpretable neural text classifiers.
arXiv.org Artificial Intelligence
Aug-9-2022
- Country:
- North America
- United States
- Texas > McLennan County
- Waco (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- Texas > McLennan County
- Canada > Alberta
- United States
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Italy > Tuscany
- Florence (0.04)
- United Kingdom > England
- Asia
- Middle East > Jordan (0.04)
- China > Hong Kong (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Health & Medicine
- Therapeutic Area (1.00)
- Health Care Providers & Services (1.00)
- Consumer Health (0.67)
- Technology: