Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks

Alizadeh, Meysam, Kubli, Maël, Samei, Zeynab, Dehghani, Shirin, Bermeo, Juan Diego, Korobeynikova, Maria, Gilardi, Fabrizio

Jul-5-2023–arXiv.org Artificial Intelligence

For instance, studies demonstrate that ChatGPT exceeds the performance of crowd-workers in tasks encompassing relevance, stance, sentiment, topic identification, and frame detection (Gilardi, Alizadeh and Kubli, 2023), that it outperforms trained annotators in detecting the political party affiliations of Twitter users (Törnberg, 2023), and that it achieves accuracy scores over 0.6 for tasks such as stance, sentiment, hate speech detection, and bot identification (Zhu et al., 2023). Notably, ChatGPT also demonstrates the ability to correctly classify more than 70% of news as either true or false (Hoes, Altay and Bermeo, 2023), which suggests that LLMs might potentially be used to assist content moderation processes. While the performance of LLMs for text annotation is promising, there are several aspects that remain unclear and require further research. Among these is the impact of different approaches such as zero-shot versus few-shot learning and settings such as varying temperature parameters. Zero-shot learning allows models to predict for unseen tasks, while few-shot learning uses a small number of examples to generalize to new tasks. The conditions under which one approach outperforms the other are not fully understood yet.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

Jul-5-2023

arXiv.org PDF

Add feedback

Country:
- South America > Venezuela (0.04)
- North America
  - Mexico (0.04)
  - United States
    - Wisconsin (0.04)
    - District of Columbia > Washington (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.04)
    - Florida > Leon County
      - Tallahassee (0.04)
    - California
      - Los Angeles County > Santa Monica (0.04)
      - Alameda County > Oakland (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Russia (0.14)
  - North Macedonia (0.04)
  - Sweden (0.04)
  - Greece (0.04)
  - Switzerland > Zürich
    - Zürich (0.14)
  - Poland > Pomerania Province
    - Gdańsk (0.04)
- Asia
  - Russia (0.46)
  - Sri Lanka (0.04)
  - China (0.04)
  - Middle East > Iran
    - Tehran Province > Tehran (0.04)
  - India > NCT
    - New Delhi (0.04)
- Africa
  - Middle East > Somalia (0.04)
  - Kenya > Nairobi City County
    - Nairobi (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Banking & Finance (1.00)
- Law > Statutes (0.94)
- Health & Medicine > Therapeutic Area (0.94)
- Media > News (0.93)
- Information Technology
  - Services (1.00)
  - Security & Privacy (1.00)
- Government
  - Voting & Elections (1.00)
  - Military (1.00)
  - Regional Government > North America Government
    - United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found