Contextual-Lexicon Approach for Abusive Language Detection

Vargas, Francielle, de Góes, Fabiana Rodrigues, Carvalho, Isabelle, Benevenuto, Fabrício, Pardo, Thiago Alexandre Salgueiro

Dec-20-2022–arXiv.org Artificial Intelligence

Since a lexicon-based approach is more elegant scientifically, explaining the solution components and being easier to generalize to other applications, this paper provides a new approach for offensive language and hate speech detection on social media. Our approach embodies a lexicon of implicit and explicit offensive and swearing expressions annotated with contextual information. Due to the severity of the social media abusive comments in Brazil, and the lack of research in Portuguese, Brazilian Portuguese is the language used to validate the models. Nevertheless, our method may be applied to any other language. The conducted experiments show the effectiveness of the proposed approach, outperforming the current baseline methods for the Portuguese language.

detection, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Dec-20-2022

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - São Paulo (0.05)
  - Rio Grande do Sul > Porto Alegre (0.04)
  - Minas Gerais (0.04)
- Oceania > New Zealand
  - North Island > Waikato (0.04)
- North America
  - United States
    - Minnesota (0.04)
    - Massachusetts (0.04)
    - Wisconsin > Dane County
      - Madison (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Spain > Valencian Community
    - Valencia Province > Valencia (0.04)
  - Italy
    - Tuscany > Florence (0.04)
    - Veneto > Venice (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Bulgaria > Varna Province
    - Varna (0.04)
- Asia > Japan
  - Honshū > Kansai > Osaka Prefecture > Osaka (0.04)

Genre:
- Research Report (0.82)

Industry:
- Government (0.93)
- Information Technology (0.68)
- Law Enforcement & Public Safety > Terrorism (0.47)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Data Science (0.93)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.32)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found