Inductive reasoning in humans and large language models

Han, Simon J., Ransom, Keith, Perfors, Andrew, Kemp, Charles

Aug-3-2023–arXiv.org Artificial Intelligence

This work was supported in part by the Complex Human Data Hub at the University of Melbourne and by ARC FT190100200. Correspondence concerning this article should be addressed to Jerome Han. Abstract The impressive recent performance of large language models has led many to wonder to what extent they can serve as models of general intelligence or are similar to human cognition. We address this issue by applying GPT-3.5 and GPT-4 to a classic problem in human inductive reasoning known as property induction. Although GPT-3.5 struggles to capture many aspects of human behaviour, GPT-4 is much more successful: for the most part, its performance qualitatively matches that of humans, and the only notable exception is its failure to capture the phenomenon of premise non-monotonicity. Our work demonstrates that property induction allows for interesting comparisons between human and machine intelligence and provides two large datasets that can serve as benchmarks for future work in this vein.

argument, gpt-3, reasoning, (14 more...)

arXiv.org Artificial Intelligence

Aug-3-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas (0.04)
  - New York (0.04)
  - California (0.04)
  - Arkansas (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Belgium > Flanders
    - Flemish Brabant > Leuven (0.04)
- Asia > Middle East
  - Republic of Türkiye (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.92)

Industry:
- Transportation > Passenger (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found