What Does Artificial Intelligence See In A Quarter Billion Global News Photographs?
What would it look like to ask a deep learning AI system to watch every political television advertisement of the 2016 presidential campaign season for two months and describe what it sees? That was the question I asked last February when I collaborated with the Internet Archive to take all 267 political ads they had identified (which had aired a collective 72,807 times as monitored by the Archive) and ran them frame-by-frame through Google's Cloud Vision API, producing what is likely the first large-scale application of production deep learning algorithms to describe the visual narratives of political advertising on television. Now, what if we took this same approach and instead of examining television, we looked at a quarter billion news photographs compiled from online news outlets in nearly every country of the world over the course of 2016? What would AI see in that vast archive of the visual narratives of the world's media? Google's Cloud Vision API is a commercial cloud service that accepts as input any arbitrary photograph and uses deep learning algorithms to catalog a wealth of data about each image, including a list of objects and activities it depicts, recognizable logos, OCR text recognition in almost 80 languages, levels of violence, an estimate of visual sentiment and even the precise location on earth the image appears to depict.
Feb-26-2017, 01:55:03 GMT
- Country:
- South America (0.04)
- North America > Central America (0.04)
- Africa > Middle East (0.04)
- Europe
- Russia (0.04)
- Middle East (0.04)
- Asia
- Russia (0.04)
- Middle East (0.04)
- China (0.04)
- Technology: