Accelerated Training for Massive Classification via Dynamic Class Selection
Zhang, Xingcheng (The Chinese University of Hong Kong) | Yang, Lei (The Chinese University of Hong Kong) | Yan, Junjie (SenseTime Group Limited) | Lin, Dahua (The Chinese University of Hong Kong)
Massive classification, a classification task defined over a vast number of classes (hundreds of thousands or even millions), has become an essential part of many real-world systems, such as face recognition. Existing methods, including the deep networks that achieved remarkable success in recent years, were mostly devised for problems with a moderate number of classes. They would meet with substantial difficulties, e.g., excessive memory demand and computational cost, when applied to massive problems. We present a new method to tackle this problem. This method can efficiently and accurately identify a small number of "active classes" for each mini-batch, based on a set of dynamic class hierarchies constructed on the fly. We also develop an adaptive allocation scheme thereon, which leads to a better tradeoff between performance and cost. On several large-scale benchmarks, our method significantly reduces the training cost and memory demand, while maintaining competitive performance.
Feb-8-2018
- Genre:
- Research Report > New Finding (0.93)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks (1.00)
- Performance Analysis > Accuracy (0.46)
- Statistical Learning (0.75)
- Natural Language (1.00)
- Vision > Face Recognition (0.67)
- Machine Learning
- Information Technology > Artificial Intelligence