Jigsaw releases data set to help develop AI that detects toxic comments
Mitigating prejudicial and abusive behavior online is no easy feat, given the level of toxicity in some communities. More than one in five respondents in a recent survey reported being subjected to physical threats, and nearly one in five experienced sexual harassment, stalking, or sustained harassment. Of those who experienced harassment, upwards of 20% said it was the result of their gender identity, race, ethnicity, sexual orientation, religion, occupation, or disability. In pursuit of a solution, Jigsaw -- the organization working under Google parent company Alphabet to tackle cyber bullying, censorship, disinformation, and other digital issues of the day -- today released what it claims is the largest public data set of comments and annotations with toxicity labels and identity labels. It's intended to help measure bias in AI comment classification systems, which Jigsaw and others have historically measured using synthetic data from template sentences.
Nov-19-2019, 23:44:44 GMT
- Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.57)
- Law (0.57)
- Media > News (0.54)
- Information Technology > Security & Privacy (0.37)
- Health & Medicine > Therapeutic Area
- Psychiatry/Psychology (0.37)
- Technology: