Towards a Human-like Open-Domain Chatbot

Adiwardana, Daniel, Luong, Minh-Thang, So, David R., Hall, Jamie, Fiedel, Noah, Thoppilan, Romal, Yang, Zi, Kulshreshtha, Apoorv, Nemade, Gaurav, Lu, Yifeng, Le, Quoc V.

Jan-31-2020–arXiv.org Machine Learning

We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation. Our experiments show strong correlation between perplexity and SSA. The fact that the best perplexity end-to-end trained Meena scores high on SSA (72% on multi-turn evaluation) suggests that a human-level SSA of 86% is potentially within reach if we can better optimize perplexity. Additionally, the full version of Meena (with a filtering mechanism and tuned decoding) scores 79% SSA, 23% higher in absolute SSA than the existing chatbots we evaluated.

meena, mitsuku, xiaoice, (16 more...)

arXiv.org Machine Learning

Jan-31-2020

arXiv.org PDF

Add feedback

Country:
- Oceania
  - Fiji (0.04)
  - Australia (0.04)
- North America > United States
  - Hawaii (0.04)
  - Wisconsin (0.04)
  - Arizona (0.04)
  - California > Santa Clara County
    - Palo Alto (0.04)
- Europe
  - Czechia > Prague (0.04)
  - Italy (0.04)
  - France (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Hungary > Budapest
    - Budapest (0.04)
  - Belgium > Flanders
    - West Flanders > Bruges (0.04)
- Asia
  - Japan (0.04)
  - Vietnam (0.04)
  - Southeast Asia (0.04)
  - India (0.04)
  - China
    - Hong Kong (0.04)
    - Guangdong Province (0.04)

Genre:
- Personal > Interview (1.00)
- Research Report > New Finding (0.92)

Industry:
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Health & Medicine > Consumer Health (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.45)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found