Clustering via Self-Supervised Diffusion
Uziel, Roy, Chelly, Irit, Freifeld, Oren, Pakman, Ari
–arXiv.org Artificial Intelligence
Diffusion models, widely recognized for their success in generative tasks, have not yet been applied to clustering. We introduce Clustering via Diffusion (CLUDI), a self-supervised framework that combines the generative power of diffusion models with pre-trained Vision Transformer features to achieve robust and accurate clustering. CLUDI is trained via a teacher-student paradigm: the teacher uses stochastic diffusion-based sampling to produce diverse cluster assignments, which the student refines into stable predictions. This stochasticity acts as a novel data augmentation strategy, enabling CLUDI to uncover intricate structures in high-dimensional data. Extensive evaluations on challenging datasets demonstrate that CLUDI achieves state-of-the-art performance in unsupervised classification, setting new benchmarks in clustering robustness and adaptability to complex data distributions. Our code is available at https://github.com/BGU-CS-VIL/CLUDI.
arXiv.org Artificial Intelligence
Jul-31-2025
- Country:
- Asia > Middle East
- Israel > Southern District > Beer-Sheva (0.04)
- North America > Canada (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.82)
- Industry:
- Education (0.93)
- Technology: