Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate
Ngueajio, Mikel K., Plaza-del-Arco, Flor Miriam, Chung, Yi-Ling, Rawat, Danda B., Curry, Amanda Cercas
–arXiv.org Artificial Intelligence
Automated counter-narratives (CN) offer a promising strategy for mitigating online hate speech, yet concerns about their affective tone, accessibility, and ethical risks remain. We propose a framework for evaluating Large Language Model (LLM)-generated CNs across four dimensions: persona framing, verbosity and readability, affective tone, and ethical robustness. Using GPT-4o-Mini, Cohere's CommandR-7B, and Meta's LLaMA 3.1-70B, we assess three prompting strategies on the MT-Conan and HatEval datasets. Our findings reveal that LLM-generated CNs are often verbose and adapted for people with college-level literacy, limiting their accessibility. While emotionally guided prompts yield more empathetic and readable responses, there remain concerns surrounding safety and effectiveness.
arXiv.org Artificial Intelligence
Jun-5-2025
- Country:
- Africa > Rwanda (0.04)
- Asia
- India > Gujarat
- Gandhinagar (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Fukuoka Prefecture > Fukuoka (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.14)
- India > Gujarat
- Europe
- Bulgaria (0.04)
- Czechia > Prague (0.04)
- Italy (0.04)
- Netherlands > South Holland
- Leiden (0.04)
- Spain (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States > Florida
- Miami-Dade County > Miami (0.04)
- Canada > Ontario
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Government
- Immigration & Customs (0.93)
- Regional Government (1.00)
- Health & Medicine (1.00)
- Information Technology (0.68)
- Law (0.67)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Government
- Technology: