TACO: Training-free Sound Prompted Segmentation via Deep Audio-visual CO-factorization