Directions in Abusive Language Training Data: Garbage In, Garbage Out
Vidgen, Bertie, Derczynski, Leon
–arXiv.org Artificial Intelligence
Data-driven analysis and detection of abusive online content covers many different tasks, phenomena, contexts, and methodologies. This paper systematically reviews abusive language dataset creation and content in conjunction with an open website for cataloguing abusive language data. This collection of knowledge leads to a synthesis providing evidence-based recommendations for practitioners working with this complex and highly diverse data.
arXiv.org Artificial Intelligence
Jul-19-2021
- Country:
- Asia > India (0.04)
- Europe
- Croatia (0.04)
- France (0.04)
- Slovenia (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.14)
- Spain > Andalusia
- Seville Province > Seville (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany (0.04)
- Italy > Tuscany
- Florence (0.04)
- Bulgaria > Varna Province
- Varna (0.04)
- North America
- Canada > British Columbia
- United States
- Hawaii (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- New York > New York County
- New York City (0.04)
- South America > Venezuela (0.04)
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.67)
- Industry:
- Education > Educational Setting
- Higher Education (0.45)
- Government (1.00)
- Health & Medicine (1.00)
- Information Technology
- Security & Privacy (1.00)
- Services (1.00)
- Law > Civil Rights & Constitutional Law (1.00)
- Law Enforcement & Public Safety (1.00)
- Leisure & Entertainment > Games
- Computer Games (0.67)
- Media (1.00)
- Education > Educational Setting
- Technology: