Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ?"

Carneiro, Bruno Machado, Linardi, Michele, Longhi, Julien

Aug-8-2023–arXiv.org Artificial Intelligence

We study Socially Unacceptable Discourse (SUD) characterization and detection in online text. We first build and present a novel corpus that contains a large variety of manually annotated texts from different online sources used so far in state-of-the-art Machine learning (ML) SUD detection solutions. This global context allows us to test the generalization ability of SUD classifiers that acquire knowledge around the same SUD categories, but from different contexts. From this perspective, we can analyze how (possibly) different annotation modalities influence SUD learning by discussing open challenges and open research directions. We also provide several data insights which can support domain experts in the annotation task.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Aug-8-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)
- Europe
  - Ukraine (0.04)
  - Germany (0.04)

Genre:
- Research Report (1.00)

Industry:
- Government (0.94)
- Media (0.69)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found