Uncertainty in Extreme Multi-label Classification
Jiang, Jyun-Yu, Chang, Wei-Cheng, Zhong, Jiong, Hsieh, Cho-Jui, Yu, Hsiang-Fu
–arXiv.org Artificial Intelligence
Extreme multi-label classification (XMC), or extreme multi-label learning, aims to find the relevant labels for a data input from an enormous label space. With increasingly growing information in the era of big data, XMC has become more and more important, and has been widely applied to various real-world applications, such as advertising [37], product search [9], and document retrieval [6]. However, for domains with potential high risks from mistakes like public health and medicine, it is crucial to model the predictive uncertainty for their downstream XMC applications like food classification [54] and medical diagnosis [2]. In particular, an input sometimes could have only few or even no matches in the label space, so the outputs could be noisy without uncertainty quantification. It is also insufficient to only model uncertainty for the entire input since XMC models could have different confidence for each label among the whole enormous space. To estimate predictive uncertainty, Bayesian and probabilistic models [20] are inherently applicable because variance can intrinsically be viewed as an uncertainty measurement. However, although Bayesian approaches are mathematically grounded to model uncertainty, their computational costs are usually exorbitant for large-scale data. To address this issue, the most popular solution is to approximate Bayesian inference by sampling models as an ensemble [17].
arXiv.org Artificial Intelligence
Oct-18-2022
- Country:
- North America > United States
- California
- Los Angeles County > Los Angeles (0.28)
- Santa Clara County > Palo Alto (0.04)
- California
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Industry:
- Health & Medicine > Diagnostic Medicine (0.34)
- Technology: