PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure
Cao, Feiqi, Han, Caren, Chung, Hyunsuk
–arXiv.org Artificial Intelligence
In this work, we propose a novel tree-based explanation technique, PEACH (Pretrained-embedding Explanation Across Contextual and Hierarchical Structure), that can explain how text-based documents are classified by using any pretrained contextual embeddings in a tree-based human-interpretable manner. Note that PEACH can adopt any contextual embeddings of the PLMs as a training input for the decision tree. Using the proposed PEACH, we perform a comprehensive analysis of several contextual embeddings on nine different NLP text classification benchmarks. This analysis demonstrates the flexibility of the model by applying several PLM contextual embeddings, its attribute selections, scaling, and clustering methods. Furthermore, we show the utility of explanations by visualising the feature selection and important trend of text classification via human-interpretable word-cloud-based trees, which clearly identify model mistakes and assist in dataset debugging. Besides interpretability, PEACH outperforms or is similar to those from pretrained models.
arXiv.org Artificial Intelligence
Apr-21-2024
- Country:
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Switzerland > Geneva
- Geneva (0.04)
- Ireland > Leinster
- North America
- Dominican Republic (0.04)
- United States
- Indiana (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Europe
- Genre:
- Research Report (0.64)
- Industry:
- Information Technology (0.48)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Decision Tree Learning (0.73)
- Neural Networks > Deep Learning (0.68)
- Statistical Learning > Clustering (0.48)
- Natural Language
- Explanation & Argumentation (0.87)
- Text Classification (0.87)
- Text Processing (1.00)
- Representation & Reasoning > Expert Systems (0.66)
- Machine Learning
- Information Technology > Artificial Intelligence