Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives
Dinkar, Tanvi, Clavel, Chloé, Vasilescu, Ioana
–arXiv.org Artificial Intelligence
Disfluencies (i.e. interruptions in the regular flow of speech), are ubiquitous to spoken discourse. Fillers ("uh", "um") are disfluencies that occur the most frequently compared to other kinds of disfluencies. Yet, to the best of our knowledge, there isn't a resource that brings together the research perspectives influencing Spoken Language Understanding (SLU) on these speech events. This aim of this article is to survey a breadth of perspectives in a holistic way; i.e. from considering underlying (psycho)linguistic theory, to their annotation and consideration in Automatic Speech Recognition (ASR) and SLU systems, to lastly, their study from a generation standpoint. This article aims to present the perspectives in an approachable way to the SLU and Conversational AI community, and discuss moving forward, what we believe are the trends and challenges in each area.
arXiv.org Artificial Intelligence
Mar-24-2023
- Country:
- North America > United States
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.14)
- New York > New York County
- New York City (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Mateo County
- Menlo Park (0.04)
- Washington > King County
- Europe
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Sweden > Stockholm
- Stockholm (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- France
- United Kingdom
- Asia
- North America > United States
- Genre:
- Research Report (1.00)
- Overview (0.67)
- Industry:
- Health & Medicine (0.93)
- Media (0.67)
- Technology: