Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Galassi, Andrea, Lippi, Marco, Torroni, Paolo

Feb-4-2019–arXiv.org Machine Learning

Attention is an increasingly popular mechanism used in a wide range of neural architectures. Because of the fast-paced advances in this domain, a systematic overview of attention is still missing. In this article, we define a unified model for attention architectures for natural language processing, with a focus on architectures designed to work with vector representation of the textual data. We discuss the dimensions along which proposals differ, the possible uses of attention, and chart the major research activities and open challenges in the area.

architecture, query, representation, (16 more...)

arXiv.org Machine Learning

Feb-4-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.14)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - Florida > Broward County
    - Fort Lauderdale (0.04)
- Europe
  - Germany > Berlin (0.04)
  - Italy > Emilia-Romagna
    - Metropolitan City of Bologna > Bologna (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
- Asia > Japan
  - Honshū > Kansai > Osaka Prefecture > Osaka (0.04)

Genre:
- Overview (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Processing (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found