Deep Embedding for Determining the Number of Clusters
Wang, Yiqi (National University of Defense Technology) | Shi, Zhan (University of Texas at Austin) | Guo, Xifeng (National University of Defense Technology) | Liu, Xinwang (National University of Defense Technology) | Zhu, En (National University of Defense Technology) | Yin, Jianping (Dongguan University of Technology)
Determining the number of clusters is important but challenging, especially for data of high dimension. In this paper, we propose Deep Embedding Determination (DED), a method that can solve jointly for the unknown number of clusters and feature extraction. DED first combines the virtues of the convolutional autoencoder and the t-SNE technique to extract low dimensional embedded features. Then it determines the number of clusters using an improved density-based clustering algorithm. Our experimental evaluation on image datasets shows significant improvement over state-of-the-art methods and robustness with respect to hyperparameter settings.
Feb-8-2018