Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning
Maggini, Michele Joshua, Merzougui, Dhia, Bandyopadhyay, Rabiraj, Dias, Gaël, Maurel, Fabrice, Gamallo, Pablo
–arXiv.org Artificial Intelligence
The spread of fake news, polarizing, politically biased, and harmful content on online platforms has been a serious concern. With large language models becoming a promising approach, however, no study has properly benchmarked their performance across different models, usage methods, and languages. This study presents a comprehensive overview of different Large Language Models adaptation paradigms for the detection of hyperpartisan and fake news, harmful tweets, and political bias. Our experiments spanned 10 datasets and 5 different languages (English, Spanish, Portuguese, Arabic and Bulgarian), covering both binary and multiclass classification scenarios. We tested different strategies ranging from parameter efficient Fine-Tuning of language models to a variety of different In-Context Learning strategies and prompts. These included zero-shot prompts, codebooks, few-shot (with both randomly-selected and diversely-selected examples using Determinantal Point Processes), and Chain-of-Thought. We discovered that In-Context Learning often underperforms when compared to Fine-Tuning a model. This main finding highlights the importance of Fine-Tuning even smaller models on task-specific settings even when compared to the largest models evaluated in an In-Context Learning setup - in our case LlaMA3.1-8b-Instruct,
arXiv.org Artificial Intelligence
Sep-10-2025
- Country:
- Asia (1.00)
- Europe (0.92)
- North America
- Mexico (0.28)
- United States (0.27)
- Genre:
- Research Report > New Finding (1.00)
- Overview (1.00)
- Industry:
- Media > News (1.00)
- Law (1.00)
- Health & Medicine (1.00)
- Government (1.00)
- Information Technology (0.92)
- Technology: