All-but-the-Top: Simple and Effective Postprocessing for Word Representations

Mu, Jiaqi, Bhat, Suma, Viswanath, Pramod

Mar-18-2018, 19:00:00 GMT–arXiv.org Machine Learning

Real-valued word representations have transformed NLP applications; popular examples are word2vec and GloVe, recognized for their ability to capture linguistic regularities. In this paper, we demonstrate a {\em very simple}, and yet counter-intuitive, postprocessing technique -- eliminate the common mean vector and a few top dominating directions from the word vectors -- that renders off-the-shelf representations {\em even stronger}. The postprocessing is empirically validated on a variety of lexical-level intrinsic tasks (word similarity, concept categorization, word analogy) and sentence-level tasks (semantic textural similarity and { text classification}) on multiple datasets and with a variety of representation methods and hyperparameter choices in multiple languages; in each case, the processed representations are consistently better than the original ones.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

Mar-18-2018, 19:00:00 GMT

arXiv.org PDF

Add feedback

Country:
- Europe (0.67)
- North America > United States (0.46)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Processing (1.00)
  - Representation & Reasoning > Semantic Networks (0.88)
  - Machine Learning
    - Neural Networks (1.00)
    - Statistical Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found