Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation 3 Boyang Li4

May-25-2025, 15:18:08 GMT–Neural Information Processing Systems

This paper studies the problem of weakly open-vocabulary semantic segmentation (WOVSS), which learns to segment objects of arbitrary classes using mere image-text pairs. Existing works turn to enhance the vanilla vision transformer by introducing explicit grouping recognition, i.e., employing several group tokens/centroids to cluster the image tokens and perform the group-text alignment. Nevertheless, these methods suffer from a granularity inconsistency regarding the usage of group tokens, which are aligned in the all-to-one v.s.

artificial intelligence, machine learning, natural language, (13 more...)

Neural Information Processing Systems

May-25-2025, 15:18:08 GMT

Conferences PDF

Add feedback

Country:
- Asia
  - China (0.28)
  - Middle East > Israel (0.14)
- Europe > Switzerland
  - Zürich > Zürich (0.14)

Genre:
- Research Report (0.66)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks > Deep Learning (0.46)
      - Statistical Learning (0.68)
    - Natural Language (1.00)
    - Vision (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)

Duplicate Docs Excel Report

Title
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation 3 Boyang Li

Similar Docs Excel Report more

Title	Similarity	Source
None found