Discontinuous Grammar as a Foreign Language
Fernández-González, Daniel, Gómez-Rodríguez, Carlos
–arXiv.org Artificial Intelligence
In order to achieve deep natural language understanding, syntactic constituent parsing is a vital step, highly demanded by many artificial intelligence systems to process both text and speech. One of the most recent proposals is the use of standard sequence-to-sequence models to perform constituent parsing as a machine translation task, instead of applying task-specific parsers. While they show a competitive performance, these text-to-parse transducers are still lagging behind classic techniques in terms of accuracy, coverage and speed. To close the gap, we here extend the framework of sequence-to-sequence models for constituent parsing, not only by providing a more powerful neural architecture for improving their performance, but also by enlarging their coverage to handle the most complex syntactic phenomena: discontinuous structures. To that end, we design several novel linearizations that can fully produce discontinuities and, for the first time, we test a sequence-to-sequence model on the main discontinuous benchmarks, obtaining competitive results on par with task-specific discontinuous constituent parsers and achieving state-of-the-art scores on the (discontinuous) English Penn Treebank.
arXiv.org Artificial Intelligence
Dec-22-2022
- Country:
- Asia
- China > Beijing
- Beijing (0.04)
- Japan > Hokkaidō
- Hokkaidō Prefecture > Sapporo (0.04)
- Middle East > Qatar
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- China > Beijing
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Pisa Province > Pisa (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- Dominican Republic (0.04)
- United States
- New York > New York County
- New York City (0.04)
- Washington > King County
- Seattle (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- California > San Diego County
- San Diego (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- Oceania > Australia
- Asia
- Genre:
- Research Report (1.00)
- Technology: