Systematic Evaluation of GPT-3 for Zero-Shot Personality Estimation

Ganesan, Adithya V, Lal, Yash Kumar, Nilsson, August Håkan, Schwartz, H. Andrew

Jun-1-2023–arXiv.org Artificial Intelligence

Very large language models (LLMs) perform extremely well on a spectrum of NLP tasks in a zero-shot setting. However, little is known about their performance on human-level NLP problems which rely on understanding psychological concepts, such as assessing personality traits. In this work, we investigate the zero-shot ability of GPT-3 to estimate the Big 5 personality traits from users' social media posts. Through a set of systematic experiments, we find that zero-shot GPT-3 performance is somewhat close to an existing pre-trained SotA for broad classification upon injecting knowledge about the trait in the prompts. However, when prompted to provide fine-grained classification, its performance drops to close to a simple most frequent class (MFC) baseline. We further analyze where GPT-3 performs better, as well as worse, than a pretrained lexical model, illustrating systematic errors that suggest ways to improve LLMs on human-level NLP tasks.

gpt-3, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

Jun-1-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - Maryland (0.04)
    - Washington > King County
      - Seattle (0.04)
    - New York > Suffolk County
      - Stony Brook (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Norway > Eastern Norway
    - Oslo (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia > Middle East
  - UAE > Abu Dhabi Emirate
    - Abu Dhabi (0.04)
  - Qatar > Ad-Dawhah
    - Doha (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found