A Survey of Available Corpora for Building Data-Driven Dialogue Systems

Serban, Iulian Vlad, Lowe, Ryan, Henderson, Peter, Charlin, Laurent, Pineau, Joelle

Mar-20-2017–arXiv.org Artificial Intelligence

During the past decade, several areas of speech and language understanding have witnessed substantial breakthroughs from the use of data-driven models. In the area of dialogue systems, the trend is less obvious, and most practical systems are still built through significant engineering and expert knowledge. Nevertheless, several recent results suggest that data-driven approaches are feasible and quite promising. To facilitate research in this area, we have carried out a wide survey of publicly available datasets suitable for data-driven learning of dialogue systems. We discuss important characteristics of these datasets, how they can be used to learn diverse dialogue strategies, and their other potential uses. We also examine methods for transfer learning between datasets and the use of external knowledge. Finally, we discuss appropriate choice of evaluation metrics for the learning objective.

information retrieval, machine learning, reinforcement learning, (25 more...)

arXiv.org Artificial Intelligence

Mar-20-2017

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)
- Europe (1.00)

Genre:
- Overview (1.00)
- Research Report > New Finding (0.34)

Industry:
- Health & Medicine (1.00)
- Information Technology (0.93)
- Transportation (0.92)
- Consumer Products & Services > Travel (0.67)
- Education > Educational Setting (0.67)
- Media
  - Television (1.00)
  - Film (1.00)
- Leisure & Entertainment > Games
  - Computer Games (0.67)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Speech > Speech Recognition (1.00)
    - Cognitive Science (1.00)
    - Representation & Reasoning
      - Agents (0.92)
      - Uncertainty > Bayesian Inference (0.92)
      - Personal Assistant Systems (0.67)
    - Natural Language
      - Text Processing (1.00)
      - Discourse & Dialogue (1.00)
      - Chatbot (1.00)
      - Machine Translation (0.93)
      - Information Retrieval (0.67)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (0.93)
      - Reinforcement Learning (0.93)
      - Learning Graphical Models
        Undirected Networks > Markov Models (1.00)
        Directed Networks > Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found