A Survey on Extreme Multi-label Learning
Wei, Tong, Mao, Zhen, Shi, Jiang-Xin, Li, Yu-Feng, Zhang, Min-Ling
–arXiv.org Artificial Intelligence
Multi-label learning has attracted significant attention from both academic and industry field in recent decades. Although existing multi-label learning algorithms achieved good performance in various tasks, they implicitly assume the size of target label space is not huge, which can be restrictive for real-world scenarios. Moreover, it is infeasible to directly adapt them to extremely large label space because of the compute and memory overhead. Therefore, eXtreme Multi-label Learning (XML) is becoming an important task and many effective approaches are proposed. To fully understand XML, we conduct a survey study in this paper. We first clarify a formal definition for XML from the perspective of supervised learning. Then, based on different model architectures and challenges of the problem, we provide a thorough discussion of the advantages and disadvantages of each category of methods. For the benefit of conducting empirical studies, we collect abundant resources regarding XML, including code implementations, and useful tools. Lastly, we propose possible research directions in XML, such as new evaluation metrics, the tail label problem, and weakly supervised XML.
arXiv.org Artificial Intelligence
Oct-8-2022
- Country:
- North America > United States
- Indiana > Madison County > Anderson (0.04)
- Asia
- Singapore (0.04)
- China > Jiangsu Province
- Nanjing (0.04)
- North America > United States
- Genre:
- Overview (1.00)
- Industry:
- Government (0.46)
- Technology:
- Information Technology
- Information Management (1.00)
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Vision (0.93)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Inductive Learning (0.88)
- Supervised Learning (0.68)
- Information Technology