Supercm: Revisiting Clustering for Semi-Supervised Learning

Singh, Durgesh, Boubekki, Ahcene, Jenssen, Robert, Kampffmeyer, Michael C.

arXiv.org Artificial Intelligence 

ABSTRACT The development of semi-supervised learning (SSL) has in recent years largely focused on the development of new consistency regularization or entropy minimization approaches, often resulting in models with complex training strategies to obtain the desired results. In this work, we instead propose a novel approach that explicitly incorporates the underlying clustering assumption in SSL through extending a recently proposed differentiable clustering module. Leveraging annotated data to guide the cluster centroids results in a simple end-to-end trainable deep SSL approach. We demonstrate that the proposed model improves the performance over the supervised-only baseline and show that our framework can be used in conjunction with other SSL methods to further boost their performance. Index T erms -- Clustering, Semi-supervised learning, Gaussian mixture models 1. INTRODUCTION Traditional deep learning has achieved state-of-the-art performance on various tasks at the cost of large-scale supervised training data.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found