Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection

Open in new window