Facebook's 'Rosetta' AI can extract text from a billion images daily
People online tend to communicate not just with words, but also with images. For a platform like Facebook with over 2 billion monthly active users, that means a plethora of images gets posted every day, including memes. In order to include images with text in relevant photo search results, to give screen readers a way to read what's written on them and to make sure they don't contain hate speech and other words that violate the website's content policy, Facebook has created and deployed a large-scale machine learning system called "Rosetta." Facebook needed an optical character recognition system that can regularly process huge volumes of content, so it had to conjure up its own technology. In a new blog post, the company explained how Rosetta works: it starts by detecting rectangular regions in images that potentially contain text.
Sep-11-2018, 16:55:13 GMT