Sources of Noise in Dialogue and How to Deal with Them
–arXiv.org Artificial Intelligence
Training dialogue systems often entails dealing with noisy training examples and unexpected user inputs. Despite their prevalence, there currently lacks an accurate survey of dialogue noise, nor is there a clear sense of the impact of each noise type on task performance. This paper addresses this gap by first constructing a taxonomy of noise encountered by dialogue systems. In addition, we run a series of experiments to show how different models behave when subjected to varying levels of noise and types of noise. Our results reveal that models are quite robust to label errors commonly tackled by existing denoising algorithms, but that performance suffers from dialogue-specific noise. Driven by these observations, we design a data cleaning algorithm specialized for conversational settings and apply it as a proof-of-concept for targeted dialogue denoising.
arXiv.org Artificial Intelligence
Jul-28-2023
- Country:
- Oceania > Australia (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Nevada (0.04)
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Colorado > Denver County
- Denver (0.04)
- California
- San Francisco County > San Francisco (0.14)
- San Diego County > San Diego (0.04)
- Los Angeles County
- Long Beach (0.14)
- Los Angeles (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Europe
- Slovenia (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy
- Czechia
- South Moravian Region > Brno (0.04)
- Prague (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Germany > Saarland
- Saarbrücken (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- China
- Hong Kong (0.04)
- Shandong Province > Qingdao (0.04)
- Taiwan > Taiwan Province
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Consumer Products & Services (0.92)
- Leisure & Entertainment > Sports (0.46)
- Technology: