Detecting Harmful Content On Online Platforms: What Platforms Need Vs. Where Research Efforts Go
Arora, Arnav, Nakov, Preslav, Hardalov, Momchil, Sarwar, Sheikh Muhammad, Nayak, Vibha, Dinkov, Yoan, Zlatkova, Dimitrina, Dent, Kyle, Bhatawdekar, Ameya, Bouchard, Guillaume, Augenstein, Isabelle
–arXiv.org Artificial Intelligence
The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more inclusive environment for their users. Researchers have developed different methods for automatically detecting harmful content, often focusing on specific sub-problems or on narrow communities, as what is considered harmful often depends on the platform and on the context. We argue that there is currently a dichotomy between what types of harmful content online platforms seek to curb, and what research efforts there are to automatically detect such content. We thus survey existing methods as well as content moderation policies by online platforms in this light and we suggest directions for future work.
arXiv.org Artificial Intelligence
Jun-6-2023
- Country:
- Asia > Middle East
- UAE (0.28)
- Europe (1.00)
- North America > United States
- California (0.28)
- Massachusetts (0.28)
- Minnesota (0.28)
- Washington > King County
- Seattle (0.28)
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Government (1.00)
- Health & Medicine (1.00)
- Information Technology
- Security & Privacy (1.00)
- Services (1.00)
- Law > Civil Rights & Constitutional Law (0.93)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Media > News (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.93)
- Natural Language (1.00)
- Machine Learning > Neural Networks
- Communications > Social Media (1.00)
- Data Science (1.00)
- Security & Privacy (1.00)
- Artificial Intelligence
- Information Technology