Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech
Pupier, Adrien, Coavoux, Maximin, Goulian, Jérôme, Lecouteux, Benjamin
–arXiv.org Artificial Intelligence
Direct dependency parsing of the speech signal -- as opposed to parsing speech transcriptions -- has recently been proposed as a task (Pupier et al. 2022), as a way of incorporating prosodic information in the parsing system and bypassing the limitations of a pipeline approach that would consist of using first an Automatic Speech Recognition (ASR) system and then a syntactic parser. In this article, we report on a set of experiments aiming at assessing the performance of two parsing paradigms (graph-based parsing and sequence labeling based parsing) on speech parsing. We perform this evaluation on a large treebank of spoken French, featuring realistic spontaneous conversations. Our findings show that (i) the graph based approach obtain better results across the board (ii) parsing directly from speech outperforms a pipeline approach, despite having 30% fewer parameters.
arXiv.org Artificial Intelligence
Jun-18-2024
- Country:
- South America > Chile
- North America
- United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Switzerland > Neuchâtel
- Neuchâtel (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.05)
- France
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.04)
- Hauts-de-France > Nord
- Lille (0.04)
- Auvergne-Rhône-Alpes > Isère
- Grenoble (0.05)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Switzerland > Neuchâtel
- Genre:
- Research Report > New Finding (0.68)
- Technology: