Yahoo releases 13.5TB Webscope data set for machine learning researchers
Yahoo is today announcing the release of a large-scale data set that describes people's usage of news feeds on several of the company's web services, including Yahoo News and Yahoo Finance. The idea is to empower machine learning researchers in academia with very rich data. The release of data is not, in and of itself, new for Yahoo -- there have been 56 previous releases in the Yahoo Labs Webscope program, which encompasses advertising, image, social, and ratings data, among other categories. This data set in particular covers 20 million people over the course of four months in 2015, and shows the types of devices people used to visit pages, how far down they got in the articles, and the top subjects of articles. There is data on people's locations, their ages (in some cases), and their gender -- all in an anonymized way. What's interesting about today's release is the size of the data set: 13.5TB.
Mar-25-2016, 09:55:54 GMT
- Country:
- North America > United States > California
- San Francisco County > San Francisco (0.06)
- San Diego County > San Diego (0.06)
- North America > United States > California
- Industry:
- Media (0.38)
- Technology: