How Well Do LLMs Predict Human Behavior? A Measure of their Pretrained Knowledge

Jan-21-2026–arXiv.org Machine Learning

Large language models (LLMs) are increasingly used in economics as predictive tools--both to generate synthetic responses in place of human subjects (Horton, 2023; Anthis et al., 2025), and to forecast economic outcomes directly (Hewitt et al., 2024a; Faria-e Castro and Leibovici, 2024; Chan-Lau et al., 2025). Their appeal in these roles is obvious: A pretrained LLM embeds a vast amount of information and can be deployed at negligible cost, often in settings where collecting new, domain-specific human data would be expensive or infeasible. What remains unclear is how to assess the quality of these predictions. This paper proposes a measure that quantifies the domain-specific value of LLMs in an interpretable unit: the amount of human data they substitute for. Specifically, we ask how much human data would be required for a conventional model trained on that data to match the predictive performance of the pretrained LLM in that domain.

large language model, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

Jan-21-2026

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Health & Medicine (0.93)
- Banking & Finance > Economy (0.67)
- Government > Regional Government
  - North America Government > United States Government (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Performance Analysis > Accuracy (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found