CLIP-Count: Towards Text-Guided Zero-Shot Object Counting

Open in new window