AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation

Jun-2-2025, 11:41:55 GMT–Neural Information Processing Systems

Open-vocabulary semantic segmentation is a challenging task that requires segmenting novel object categories at inference time. Recent works explore vision-language pre-training to handle this task, but suffer from unrealistic assumptions in practical scenarios, i.e., low-quality textual category names. For example, this paradigm assumes that new textual categories will be accurately and completely provided, and exist in lexicons during pre-training. However, exceptions often happen when meet with ambiguity for brief or incomplete names, new words that are not present in the pre-trained lexicons, and difficult-to-describe categories for users. To address these issues, this work proposes a novel attribute decomposition-aggregation framework, AttrSeg, inspired by human cognition in understanding new concepts.

large language model, machine learning, segmentation, (20 more...)

Neural Information Processing Systems

Jun-2-2025, 11:41:55 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.14)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment (0.46)
- Media > Film (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.68)
  - Natural Language > Large Language Model (0.70)
  - Representation & Reasoning > Object-Oriented Architecture (0.48)
  - Vision > Image Understanding (0.46)