A Survey of Available Corpora for Building Data-Driven Dialogue Systems
Serban, Iulian Vlad, Lowe, Ryan, Henderson, Peter, Charlin, Laurent, Pineau, Joelle
–arXiv.org Artificial Intelligence
During the past decade, several areas of speech and language understanding have witnessed substantial breakthroughs from the use of data-driven models. In the area of dialogue systems, the trend is less obvious, and most practical systems are still built through significant engineering and expert knowledge. Nevertheless, several recent results suggest that data-driven approaches are feasible and quite promising. To facilitate research in this area, we have carried out a wide survey of publicly available datasets suitable for data-driven learning of dialogue systems. We discuss important characteristics of these datasets, how they can be used to learn diverse dialogue strategies, and their other potential uses. We also examine methods for transfer learning between datasets and the use of external knowledge. Finally, we discuss appropriate choice of evaluation metrics for the learning objective.
arXiv.org Artificial Intelligence
Mar-20-2017
- Country:
- Europe > United Kingdom
- England (0.14)
- North America
- Canada > Quebec
- Montreal (0.14)
- United States > North Carolina (0.14)
- Canada > Quebec
- Europe > United Kingdom
- Industry:
- Consumer Products & Services > Travel (0.67)
- Education > Educational Setting (0.67)
- Health & Medicine (1.00)
- Information Technology (0.93)
- Leisure & Entertainment > Games
- Computer Games (0.67)
- Media
- Film (1.00)
- Television (1.00)
- Transportation (0.92)
- Technology:
- Information Technology
- Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning
- Learning Graphical Models
- Directed Networks > Bayesian Learning (1.00)
- Undirected Networks > Markov Models (1.00)
- Neural Networks > Deep Learning (0.93)
- Reinforcement Learning (0.93)
- Statistical Learning (1.00)
- Learning Graphical Models
- Natural Language
- Chatbot (1.00)
- Discourse & Dialogue (1.00)
- Information Retrieval (0.67)
- Machine Translation (0.93)
- Text Processing (1.00)
- Representation & Reasoning
- Agents (0.92)
- Personal Assistant Systems (0.67)
- Uncertainty > Bayesian Inference (0.92)
- Speech > Speech Recognition (1.00)
- Communications > Social Media (1.00)
- Artificial Intelligence
- Information Technology