Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies

Aher, Gati, Arriaga, Rosa I., Kalai, Adam Tauman

Jul-9-2023–arXiv.org Artificial Intelligence

We introduce a new type of test, called a Turing Experiment (TE), for evaluating to what extent a given language model, such as GPT models, can simulate different aspects of human behavior. A TE can also reveal consistent distortions in a language model's simulation of a specific human behavior. Unlike the Turing Test, which involves simulating a single arbitrary individual, a TE requires simulating a representative sample of participants in human subject research. We carry out TEs that attempt to replicate well-established findings from prior studies. We design a methodology for simulating TEs and illustrate its use to compare how well different language models are able to reproduce classic economic, psycholinguistic, and social psychology experiments: Ultimatum Game, Garden Path Sentences, Milgram Shock Experiment, and Wisdom of Crowds. In the first three TEs, the existing findings were replicated using recent models, while the last TE reveals a "hyper-accuracy distortion" present in some language models (including ChatGPT and GPT-4), which could affect downstream applications in education and the arts.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jul-9-2023

arXiv.org PDF

Add feedback

Country:
- South America > Uruguay
  - Maldonado > Maldonado (0.04)
- North America > United States
  - Alaska (0.04)
  - California > San Diego County
    - San Diego (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Law (0.46)
- Health & Medicine (0.46)
- Leisure & Entertainment (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found