Goto

Collaborating Authors

 abc



ABC: Auxiliary Balanced Classifier for Class-Imbalanced Semi-Supervised Learning

Neural Information Processing Systems

Existing semi-supervised learning (SSL) algorithms typically assume classbalanced datasets, although the class distributions of many real-world datasets are imbalanced. In general, classifiers trained on a class-imbalanced dataset are biased toward the majority classes. This issue becomes more problematic for SSL algorithms because they utilize the biased prediction of unlabeled data for training. However, traditional class-imbalanced learning techniques, which are designed for labeled data, cannot be readily combined with SSL algorithms. We propose a scalable class-imbalanced SSL algorithm that can effectively use unlabeled data, while mitigating class imbalance by introducing an auxiliary balanced classifier (ABC) of a single layer, which is attached to a representation layer of an existing SSL algorithm. The ABC is trained with a class-balanced loss of a minibatch, while using high-quality representations learned from all data points in the minibatch using the backbone SSL algorithm to avoid overfitting and information loss. Moreover, we use consistency regularization, a recent SSL technique for utilizing unlabeled data in a modified way, to train the ABC to be balanced among the classes by selecting unlabeled data with the same probability for each class. The proposed algorithm achieves state-of-the-art performance in various class-imbalanced SSL experiments using four benchmark datasets.



Appendix

Neural Information Processing Systems

We provide more information on AIPS' deductive engine and the training process for the value network. To highlight the reasoning ability and maintain readability of proofs, we avoid using brute-force methods such as augmentation-substitution and Wu's method Wu [1978].






Appendix AProofs A.1 ProofofLemma3.1

Neural Information Processing Systems

A.4 ProofofLemma4.5 The proof of Lemma 4.5 depends on another lemma, which will also be useful in the unknown hypothesis class setting. Then there exists somec for whichhc(x) / {0,1}, which is impossible. If qH(x) = c, we don't need to prove anything. Let h H denote the decision maker's decision rule. From Corollary 4.6, we know that h(x)=1 {qH(x) c}aslongasqH(x)6=c.