Most popular kaggle competition solutions
Large Scale Hierarchical Text Classification is a document classification challenge to classify a given Wikipedia document into one of the 325,056 categories. Wikipedia has created this very large dataset. The dataset is multi-class, multi-label and hierarchical. The numbers of categories were somewhere around 325,000 and the numbers documents size is 2,400,000. This challenge builds upon a series of successful challenges on large-scale hierarchical text classification. Demokritos will give more information on this dataset at http://lshtc.iit.demokritos.gr/
Oct-8-2016, 14:56:27 GMT