The Dire Defect of 'Multilingual' AI Content Moderation
This is part of the data recipe for Facebook's new large language model, which the company claims is able to detect and rein in harmful content in over 100 languages. Bumble uses similar technology to detect rude and unwanted messages in at least 15 languages. Google uses it for everything from translation to filtering newspaper comment sections. All have comparable recipes and the same dominant ingredient: English-language data. For years, social media companies have focused their automatic content detection and removal efforts more on content in English than the world's 7,000 other languages.
May-23-2023, 12:00:00 GMT
- Country:
- South America > Brazil (0.06)
- Africa > Ethiopia (0.06)
- Asia
- Philippines (0.06)
- Myanmar (0.06)
- Industry:
- Media (1.00)
- Information Technology > Services (0.96)
- Technology: