Yahoo Labs released largest-ever annonymized machine learning data set for researchers

In January 2016, Yahoo announce the public release of the largest-ever machine learning data set to the international research community. The data set stands at a massive ~110B events (13.5TB uncompressed) of anonymized user-news item interaction data, collected by recording the user-news item interactions of about 20M users from February 2015 to May 2015.

see: https://yahoolabs.tumblr.com/post/137281912191/yahoo-releases-the-largest-ever-machine-learning