Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness

Mielke, Sabrina J., Szlam, Arthur, Boureau, Y-Lan, Dinan, Emily

Dec-29-2020–arXiv.org Artificial Intelligence

Open-domain dialogue agents have vastly improved, but still confidently hallucinate knowledge or express doubt when asked straightforward questions. In this work, we analyze whether state-of-the-art chit-chat models can express metacognition capabilities through their responses: does a verbalized expression of doubt (or confidence) match the likelihood that the model's answer is incorrect (or correct)? We find that these models are poorly calibrated in this sense, yet we show that the representations within the models can be used to accurately predict likelihood of correctness. By incorporating these correctness predictions into the training of a controllable generation model, we obtain a dialogue agent with greatly improved linguistic calibration.

correctness, neural network, us government, (22 more...)

arXiv.org Artificial Intelligence

Dec-29-2020

arXiv.org PDF

Add feedback

Country:
- Europe (1.00)
- North America > United States
  - Minnesota > Hennepin County > Minneapolis (0.14)

Genre:
- Research Report (0.64)

Industry:
- Government > Regional Government
  - North America Government > United States Government (0.94)
- Health & Medicine (0.93)
- Leisure & Entertainment > Games (0.68)
- Materials > Metals & Mining
  - Steel (1.00)
- Media > Film (0.68)

Technology:
- Information Technology
  - Artificial Intelligence
    - Cognitive Science (0.84)
    - Machine Learning > Neural Networks
      - Deep Learning (0.46)
    - Natural Language (1.00)
    - Representation & Reasoning (1.00)
  - Communications > Social Media (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found