Uncertainty in Extreme Multi-label Classification

Jiang, Jyun-Yu, Chang, Wei-Cheng, Zhong, Jiong, Hsieh, Cho-Jui, Yu, Hsiang-Fu

Oct-18-2022–arXiv.org Artificial Intelligence

Extreme multi-label classification (XMC), or extreme multi-label learning, aims to find the relevant labels for a data input from an enormous label space. With increasingly growing information in the era of big data, XMC has become more and more important, and has been widely applied to various real-world applications, such as advertising [37], product search [9], and document retrieval [6]. However, for domains with potential high risks from mistakes like public health and medicine, it is crucial to model the predictive uncertainty for their downstream XMC applications like food classification [54] and medical diagnosis [2]. In particular, an input sometimes could have only few or even no matches in the label space, so the outputs could be noisy without uncertainty quantification. It is also insufficient to only model uncertainty for the entire input since XMC models could have different confidence for each label among the whole enormous space. To estimate predictive uncertainty, Bayesian and probabilistic models [20] are inherently applicable because variance can intrinsically be viewed as an uncertainty measurement. However, although Bayesian approaches are mathematically grounded to model uncertainty, their computational costs are usually exorbitant for large-scale data. To address this issue, the most popular solution is to approximate Bayesian inference by sampling models as an ensemble [17].

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Artificial Intelligence

Oct-18-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California
    - Los Angeles County > Los Angeles (0.28)
    - Santa Clara County > Palo Alto (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.64)

Industry:
- Health & Medicine > Diagnostic Medicine (0.34)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (0.68)
    - Machine Learning
      - Ensemble Learning (0.93)
      - Neural Networks > Deep Learning (0.47)
      - Performance Analysis > Accuracy (0.46)
      - Learning Graphical Models > Directed Networks
        Bayesian Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found