Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks
Zhang, Yingji, Carvalho, Danilo S., Pratt-Hartmann, Ian, Freitas, André
–arXiv.org Artificial Intelligence
Disentangling sentence representations over continuous spaces can be a critical process in improving interpretability and semantic control by localising explicit generative factors. Such process confers to neural-based language models some of the advantages that are characteristic of symbolic models, while keeping their flexibility. This work presents a methodology for disentangling the hidden space of a BERT-GPT2 autoencoder by transforming it into a more separable semantic space with the support of a flow-based invertible neural network (INN). Experimental results indicate that the INN can transform the distributed hidden space into a better semantically disentangled latent space, resulting in better interpretability and controllability, when compared to recent state-of-the-art models.
arXiv.org Artificial Intelligence
May-2-2023
- Country:
- North America > United States
- Illinois > Cook County > Chicago (0.04)
- Europe
- Switzerland (0.04)
- United Kingdom > England
- Greater Manchester > Manchester (0.04)
- Asia > Japan
- Kyūshū & Okinawa > Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- North America > United States
- Genre:
- Research Report (0.84)
- Industry:
- Materials (0.46)
- Technology: