Online Multi-Label Classification: A Label Compression Method
Many modern applications deal with multi-label data, such as functional categorizations of genes, image labeling and text categorization. Classification of such data with a large number of labels and latent dependencies among them is a challenging task, and it becomes even more challenging when the data is received online and in chunks. Many of the current multi-label classification methods require a lot of time and memory, which make them infeasible for practical real-world applications. In this paper, we propose a fast linear label space dimension reduction method that transforms the labels into a reduced encoded space and trains models on the obtained pseudo labels. Additionally, it provides an analytical method to update the decoding matrix which maps the labels into the original space and is used during the test phase. Experimental results show the effectiveness of this approach in terms of running times and the prediction performance over different measures. Keywords: data stream classification, multi-label data, label compression 1. Introduction Standard classification is the task of assigning the correct class to previously unknown test instances based on training instances. Training data consist of a set of features and an associated target class or class label. Many modern data mining applications, however, need to deal with more than one label per instance.
Apr-4-2018
- Country:
- Oceania > New Zealand
- North Island > Waikato (0.04)
- Europe > Germany
- Rheinland-Pfalz > Mainz (0.04)
- Asia > Middle East
- Lebanon (0.04)
- Oceania > New Zealand
- Genre:
- Research Report > New Finding (0.48)
- Technology: