AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
–Neural Information Processing Systems
Open-vocabulary semantic segmentation is a challenging task that requires segmenting novel object categories at inference time. Recent works explore vision-language pre-training to handle this task, but suffer from unrealistic assumptions in practical scenarios, i.e., low-quality textual category names. For example, this paradigm assumes that new textual categories will be accurately and completely provided, and exist in lexicons during pre-training. However, exceptions often happen when meet with ambiguity for brief or incomplete names, new words that are not present in the pre-trained lexicons, and difficult-to-describe categories for users. To address these issues, this work proposes a novel attribute decomposition-aggregation framework, AttrSeg, inspired by human cognition in understanding new concepts.
Neural Information Processing Systems
Jun-2-2025, 11:41:55 GMT
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment (0.46)
- Media > Film (0.46)
- Technology: