Is the User Enjoying the Conversation? A Case Study on the Impact on the Reward Function

Jan-13-2021–arXiv.org Artificial Intelligence

The impact of user satisfaction in policy learning task-oriented dialogue systems has long been a subject of research interest. Most current models for estimating the user satisfaction either (i) treat out-of-context short-texts, such as product reviews, or (ii) rely on turn features instead of on distributed semantic representations. In this work we adopt deep neural networks that use distributed semantic representation learning for estimating the user satisfaction in conversations. We evaluate the impact of modelling context length in these networks. Moreover, we show that the proposed hierarchical network outperforms state-of-the-art quality estimators. Furthermore, we show that applying these networks to infer the reward function in a Partial Observable Markov Decision Process (POMDP) yields to a great improvement in the task success rate.

dialogue, dialogue system, representation, (14 more...)

arXiv.org Artificial Intelligence

Jan-13-2021

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > California
    - San Francisco County > San Francisco (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - France (0.04)
  - Czechia > Prague (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Middle East > Republic of Türkiye
    - Istanbul Province > Istanbul (0.04)
- Asia > Middle East
  - Republic of Türkiye > Istanbul Province
    - Istanbul (0.04)
  - Qatar > Ad-Dawhah
    - Doha (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Discourse & Dialogue (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Statistical Learning (0.94)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found