Distilled Wasserstein Learning for Word Embedding and Topic Modeling
Xu, Hongteng, Wang, Wenlin, Liu, Wei, Carin, Lawrence
–Neural Information Processing Systems
We propose a novel Wasserstein method with a distillation mechanism, yielding joint learning of word embeddings and topics. The proposed method is based on the fact that the Euclidean distance between word embeddings may be employed as the underlying distance in the Wasserstein topic model. The word distributions of topics, their optimal transport to the word distributions of documents, and the embeddings of words are learned in a unified framework. When learning the topic model, we leverage a distilled ground-distance matrix to update the topic distributions and smoothly calculate the corresponding optimal transports. Such a strategy provides the updating of word embeddings with robust guidance, improving algorithm convergence.
Neural Information Processing Systems
Feb-14-2020, 08:41:49 GMT
- Technology: