Clustering As A Universal Visual Learner