Assessing the Human Likeness of AI-Generated Counterspeech

Song, Xiaoying, Mamidisetty, Sujana, Blanco, Eduardo, Hong, Lingzi

Dec-15-2024–arXiv.org Artificial Intelligence

Counterspeech is a targeted response to counteract and challenge abusive or hateful content. It effectively curbs the spread of hatred and fosters constructive online communication. Previous studies have proposed different strategies for automatically generated counterspeech. Evaluations, however, focus on relevance, surface form, and other shallow linguistic characteristics. This paper investigates the human likeness of AI-generated counterspeech, a critical factor influencing effectiveness. We implement and evaluate several LLM-based generation strategies, and discover that AI-generated and human-written counterspeech can be easily distinguished by both simple classifiers and humans. Further, we reveal differences in linguistic characteristics, politeness, and specificity. The dataset used in this study is publicly available for further research.

counterspeech, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Dec-15-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas (0.14)
  - Arizona (0.04)
- Asia
  - India (0.04)
  - Indonesia > Java
    - East Java > Surabaya (0.04)

Genre:
- Research Report
  - New Finding (0.88)
  - Experimental Study (0.68)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology
  - Communications > Social Media (0.99)
  - Artificial Intelligence
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found