Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
–Neural Information Processing Systems
Two popular forms of weak-supervision used in open-vocabulary detection (OVD) include pretrained CLIP model and image-level supervision.
Neural Information Processing Systems
Aug-19-2025, 09:13:50 GMT
- Genre:
- Research Report > New Finding (0.68)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence