Self-improving Chatbots based on Deep Reinforcement Learning

Nov-23-2020, 19:50:44 GMT–#artificialintelligence

We present a Reinforcement Learning (RL) model for self-improving chatbots, specifically targeting FAQ-type chatbots. The model is not aimed at building a dialog system from scratch, but to leverage data from user conversations to improve chatbot performance. At the core of our approach is a score model, which is trained to score chatbot utterance-response tuples based on user feedback. The scores predicted by this model are used as rewards for the RL agent. Policy learning takes place offline, thanks to an user simulator which is fed with utterances from the FAQ-database.

chatbot, score model, utterance-response tuple, (12 more...)

#artificialintelligence

Nov-23-2020, 19:50:44 GMT

News Web Page

Add feedback

Country:
- North America > Canada > Quebec > Montreal (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found