Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks
Alizadeh, Meysam, Kubli, Maël, Samei, Zeynab, Dehghani, Shirin, Bermeo, Juan Diego, Korobeynikova, Maria, Gilardi, Fabrizio
–arXiv.org Artificial Intelligence
For instance, studies demonstrate that ChatGPT exceeds the performance of crowd-workers in tasks encompassing relevance, stance, sentiment, topic identification, and frame detection (Gilardi, Alizadeh and Kubli, 2023), that it outperforms trained annotators in detecting the political party affiliations of Twitter users (Törnberg, 2023), and that it achieves accuracy scores over 0.6 for tasks such as stance, sentiment, hate speech detection, and bot identification (Zhu et al., 2023). Notably, ChatGPT also demonstrates the ability to correctly classify more than 70% of news as either true or false (Hoes, Altay and Bermeo, 2023), which suggests that LLMs might potentially be used to assist content moderation processes. While the performance of LLMs for text annotation is promising, there are several aspects that remain unclear and require further research. Among these is the impact of different approaches such as zero-shot versus few-shot learning and settings such as varying temperature parameters. Zero-shot learning allows models to predict for unseen tasks, while few-shot learning uses a small number of examples to generalize to new tasks. The conditions under which one approach outperforms the other are not fully understood yet.
arXiv.org Artificial Intelligence
Jul-5-2023
- Country:
- Africa
- Kenya > Nairobi City County
- Nairobi (0.04)
- Middle East > Somalia (0.04)
- Kenya > Nairobi City County
- Asia
- China (0.04)
- India > NCT
- New Delhi (0.04)
- Middle East > Iran
- Tehran Province > Tehran (0.04)
- Russia (0.46)
- Sri Lanka (0.04)
- Europe
- Greece (0.04)
- North Macedonia (0.04)
- Poland > Pomerania Province
- Gdańsk (0.04)
- Russia (0.14)
- Sweden (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico (0.04)
- United States
- California
- Alameda County > Oakland (0.04)
- Los Angeles County > Santa Monica (0.04)
- District of Columbia > Washington (0.04)
- Florida > Leon County
- Tallahassee (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Wisconsin (0.04)
- California
- Canada > Ontario
- South America > Venezuela (0.04)
- Africa
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance (1.00)
- Government
- Military (1.00)
- Regional Government > North America Government
- United States Government (1.00)
- Voting & Elections (1.00)
- Health & Medicine > Therapeutic Area (0.94)
- Information Technology
- Security & Privacy (1.00)
- Services (1.00)
- Law > Statutes (0.94)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Media > News (0.93)
- Technology: