Localized Vision-Language Matching for Open-vocabulary Object Detection

Open in new window