Do language models have coherent mental models of everyday things?

Gu, Yuling, Mishra, Bhavana Dalvi, Clark, Peter

Jun-8-2023–arXiv.org Artificial Intelligence

When people think of everyday things like an egg, they typically have a mental image associated with it. This allows them to correctly judge, for example, that "the yolk surrounds the shell" is a false statement. Do language models similarly have a coherent picture of such everyday things? To investigate this, we propose a benchmark dataset consisting of 100 everyday things, their parts, and the relationships between these parts, expressed as 11,720 "X relation Y?" true/false questions. Using these questions as probes, we observe that state-ofthe-art pre-trained language models (LMs) like GPT-3 and Macaw have fragments of knowledge about these everyday things, but do not have fully coherent "parts mental models" (54-59% accurate, 19-43% conditional constraint violation). We propose an extension where we add a constraint satisfaction layer on top of the LM's raw predictions to apply commonsense constraints. As well as removing inconsistencies, we find that this also significantly improves accuracy (by 16-20%), suggesting how the incoherence of the LM's pictures of Figure 1: While humans appear to have coherent mental everyday things can be significantly reduced.

artificial intelligence, mental model, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jun-8-2023

arXiv.org PDF

Add feedback

Country:
- Europe (0.67)
- North America > United States
  - Minnesota (0.28)

Genre:
- Research Report (0.82)

Industry:
- Transportation > Air (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Representation & Reasoning > Constraint-Based Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found