Directions in Abusive Language Training Data: Garbage In, Garbage Out
Vidgen, Bertie, Derczynski, Leon
–arXiv.org Artificial Intelligence
Data-driven analysis and detection of abusive online content covers many different tasks, phenomena, contexts, and methodologies. This paper systematically reviews abusive language dataset creation and content in conjunction with an open website for cataloguing abusive language data. This collection of knowledge leads to a synthesis providing evidence-based recommendations for practitioners working with this complex and highly diverse data.
arXiv.org Artificial Intelligence
Jul-19-2021
- Country:
- South America > Venezuela (0.04)
- Asia > India (0.04)
- North America
- United States
- Hawaii (0.04)
- New York > New York County
- New York City (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- Canada > British Columbia
- United States
- Europe
- France (0.04)
- Germany (0.04)
- Slovenia (0.04)
- Croatia (0.04)
- Bulgaria > Varna Province
- Varna (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Spain > Andalusia
- Seville Province > Seville (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Greater London > London (0.04)
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.67)
- Industry:
- Media (1.00)
- Law Enforcement & Public Safety (1.00)
- Law > Civil Rights & Constitutional Law (1.00)
- Health & Medicine (1.00)
- Government (1.00)
- Leisure & Entertainment > Games
- Computer Games (0.67)
- Information Technology
- Services (1.00)
- Security & Privacy (1.00)
- Education > Educational Setting
- Higher Education (0.45)
- Technology: