Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space

Neelakantan, Arvind, Shankar, Jeevan, Passos, Alexandre, McCallum, Andrew

Apr-24-2015–arXiv.org Machine Learning

There is rising interest in vector-space word embeddings and their use in NLP, especially given recent methods for their fast estimation at very large scale. Nearly all this work, however, assumes a single vector per word type ignoring polysemy and thus jeopardizing their usefulness for downstream tasks. We present an extension to the Skip-gram model that efficiently learns multiple embeddings per word type. It differs from recent related work by jointly performing word sense discrimination and embedding learning, by non-parametrically estimating the number of senses per word type, and by its efficiency and scalability. We present new state-of-the-art results in the word similarity in context task and demonstrate its scalability by training with one machine on a corpus of nearly 1 billion tokens in less than 6 hours.

artificial intelligence, representation, text processing, (17 more...)

arXiv.org Machine Learning

Apr-24-2015

arXiv.org PDF

Add feedback

Country:
- North America > United States > Massachusetts > Hampshire County > Amherst (0.14)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment (0.93)
- Media > Television (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Statistical Learning (1.00)
    - Supervised Learning > Representation Of Examples (0.61)
  - Natural Language > Text Processing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found