Probabilistic Label Trees for Extreme Multi-label Classification
Jasinska-Kobus, Kalina, Wydmuch, Marek, Dembczynski, Krzysztof, Kuznetsov, Mikhail, Busa-Fekete, Robert
Extreme multi-label classification (XMLC) is a learning task of tagging instances with a small subset of relevant labels chosen from an extremely large pool of possible labels. Problems of this scale can be efficiently handled by organizing labels as a tree, like in hierarchical softmax used for multi-class problems. In this paper, we thoroughly investigate probabilistic label trees (PLTs) which can be treated as a generalization of hierarchical softmax for multi-label problems. We first introduce the PLT model and discuss training and inference procedures and their computational costs. Next, we prove the consistency of PLTs for a wide spectrum of performance metrics. To this end, we upperbound their regret by a function of surrogate-loss regrets of node classifiers. Furthermore, we consider a problem of training PLTs in a fully online setting, without any prior knowledge of training instances, their features, or labels. In this case, both node classifiers and the tree structure are trained online. We prove a specific equivalence between the fully online algorithm and an algorithm with a tree structure given in advance. Finally, we discuss several implementations of PLTs and introduce a new one, napkinXC, which we empirically evaluate and compare with state-of-the-art algorithms.
Sep-23-2020
- Country:
- North America > United States
- New Jersey (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- New York > New York County
- New York City (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- Europe
- Austria > Vienna (0.14)
- Italy > Sardinia (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Poland > Greater Poland Province
- Poznań (0.04)
- Asia
- North America > United States
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.67)
- Research Report