Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness
Mielke, Sabrina J., Szlam, Arthur, Boureau, Y-Lan, Dinan, Emily
–arXiv.org Artificial Intelligence
Open-domain dialogue agents have vastly improved, but still confidently hallucinate knowledge or express doubt when asked straightforward questions. In this work, we analyze whether state-of-the-art chit-chat models can express metacognition capabilities through their responses: does a verbalized expression of doubt (or confidence) match the likelihood that the model's answer is incorrect (or correct)? We find that these models are poorly calibrated in this sense, yet we show that the representations within the models can be used to accurately predict likelihood of correctness. By incorporating these correctness predictions into the training of a controllable generation model, we obtain a dialogue agent with greatly improved linguistic calibration.
arXiv.org Artificial Intelligence
Dec-29-2020
- Country:
- South America > Brazil (0.04)
- Oceania > Australia (0.04)
- Atlantic Ocean > Gulf of Mexico (0.04)
- Africa (0.04)
- North America
- Mexico (0.14)
- United States
- Pennsylvania (0.04)
- Washington > King County
- Seattle (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Florida > Duval County
- Jacksonville (0.04)
- California > Los Angeles County
- Los Angeles (0.04)
- Canada > British Columbia
- Europe
- Greece (0.04)
- France (0.04)
- Italy (0.04)
- North Macedonia (0.04)
- Austria > Styria
- Graz (0.04)
- Germany
- Lower Saxony (0.04)
- North Rhine-Westphalia > Cologne Region
- Bonn (0.04)
- United Kingdom > England
- West Midlands > Birmingham (0.04)
- Ukraine > Kyiv Oblast
- Chernobyl (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > Middle East
- Qatar (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Health & Medicine (0.94)
- Media > Film (0.68)
- Leisure & Entertainment > Games (0.68)
- Materials > Metals & Mining
- Steel (1.00)
- Government > Regional Government
- Technology: