Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

Ravikiran, Manikandan, Chakravarthi, Bharathi Raja, Madasamy, Anand Kumar, Sivanesan, Sangeetha, Rajalakshmi, Ratnavel, Thavareesan, Sajeetha, Ponnusamy, Rahul, Mahadevan, Shankar

arXiv.org Artificial Intelligence 

(Sivanantham and Seran, 2019). It is widely spoken in the southern state of Tamil Nadu in India, Combating offensive content is crucial for different Sri Lanka, Malaysia, and Singapore. Tamil is an entities involved in content moderation, which official language of Tamil Nadu, Sri Lanka, Singapore, includes social media companies as well as individuals and the Union Territory of Puducherry in (Kumaresan et al., 2021; Chakravarthi and India. Significant minority speak Tamil in the four Muralidaran, 2021). To this end, moderation is other South Indian states of Kerala, Karnataka, often restrictive with either usage of human content Andhra Pradesh, and Telangana, as well as the moderators, who are expected to read through Union Territory of the Andaman and Nicobar Islands the content and flag the offensive mentions (Arsht (Sakuntharaj and Mahesan, 2021, 2017, 2016; and Etcovitch, 2018). Alternatively, there are Thavareesan and Mahesan, 2019, 2020a,b, 2021).

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found