Multi-Modal Classifiers for Open-Vocabulary Object Detection

Open in new window