DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut

Neural Information Processing Systems 

Foundation models have emerged as powerful tools across various domains including language, vision, and multimodal tasks.