VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion

Jun-14-2026, 03:28:36 GMT–Neural Information Processing Systems

Current perception models have achieved remarkable success by leveraging large-scale labeled datasets, but still face challenges in open-world environments with novel objects. To address this limitation, researchers introduce open-set perception models to detect or segment arbitrary test-time user-input categories. However, open-set models rely on human involvement to provide predefined object categories as input during inference. More recently, researchers have framed a more realistic and challenging task known as open-ended perception that aims to discover unseen objects without requiring any category-level input from humans at inference time. Nevertheless, open-ended models suffer from low performance compared to open-set models.

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Jun-14-2026, 03:28:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.56)