Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration
–arXiv.org Artificial Intelligence
Algorithmic hate speech detection faces significant challenges due to the diverse definitions and datasets used in research and practice. Social media platforms, legal frameworks, and institutions each apply distinct yet overlapping definitions, complicating classification efforts. This study addresses these challenges by demonstrating that existing datasets and taxonomies can be integrated into a unified model, enhancing prediction performance and reducing reliance on multiple specialized classifiers. The work introduces a universal taxonomy and a hate speech classifier capable of detecting a wide range of definitions within a single framework. Our approach is validated by combining two widely used but differently annotated datasets, showing improved classification performance on an independent test set. This work highlights the potential of dataset and taxonomy integration in advancing hate speech detection, increasing efficiency, and ensuring broader applicability across contexts.
arXiv.org Artificial Intelligence
Mar-7-2025
- Country:
- Africa (0.04)
- North America > United States
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New Mexico > Santa Fe County
- Europe
- Austria > Vienna (0.14)
- Eastern Europe (0.04)
- Poland > Łódź Province
- Łódź (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Pakistan (0.04)
- China (0.04)
- South Korea > Incheon
- Incheon (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Law (1.00)
- Information Technology (0.94)
- Technology:
- Information Technology
- Data Science (1.00)
- Communications > Social Media (1.00)
- Artificial Intelligence
- Natural Language (1.00)
- Machine Learning (1.00)
- Representation & Reasoning > Ontologies (0.69)
- Information Technology