Distinguish Polarity in Bag-of-Words Visualization
Xie, Yusheng (Baidu Research) | Chen, Zhengzhang (Northwestern University) | Agrawal, Ankit (Northwestern University) | Choudhary, Alok (Northwestern University)
Neural network-based BOW models reveal that word-embedding vectors encode strong semantic regularities. However, such models are insensitive to word polarity. We show that, coupled with simple information such as word spellings, word-embedding vectors can preserve both semantic regularity and conceptual polarity without supervision. We then describe a nontrivial modification to the t-distributed stochastic neighbor embedding (t-SNE) algorithm that visualizes these semantic- and polarity-preserving vectors in reduced dimensions. On a real Facebook corpus, our experiments show significant improvement in t-SNE visualization as a result of the proposed modification.
- Country:
- South America > Chile (0.04)
- North America > United States
- Rhode Island > Providence County
- Providence (0.04)
- Illinois > Cook County
- Evanston (0.04)
- Florida > Hillsborough County
- Tampa (0.04)
- California > Santa Clara County
- Sunnyvale (0.04)
- Rhode Island > Providence County
- Asia > Middle East
- Jordan (0.04)
- Technology: