a7c4163b33286261b24c72fd3d1707c9-Supplemental-Datasets_and_Benchmarks.pdf
–Neural Information Processing Systems
These datasets enable large-scale study of abuse detection for these languages. Anonymized comments: To further address privacy concerns, we anonymize our dataset. We combine thehate and offensivecategories in these datasets for training a binary classification model. We showthepercentage (%)ofemoticons present inourdatasetMACDinTable12. Infuture work,we will investigate in detail about the impact of emoticons on abuse detection. However,duetothe limited scale and diversity of abuse detection datasets in Indic languages, development of these models for Indic languages has been severely impeded.
Neural Information Processing Systems
Feb-19-2026, 09:12:33 GMT
- Country:
- Asia > India > West Bengal > Kharagpur (0.04)
- Industry:
- Information Technology (0.34)
- Law (0.47)
- Technology: