Consistent Structural Relation Learning for Zero-Shot Segmentation

Oct-10-2024, 13:20:53 GMT–Neural Information Processing Systems

Zero-shot semantic segmentation aims to recognize the semantics of pixels from unseen categories with zero training samples. Previous practice [1] proposed to train the classifiers for unseen categories using the visual features generated from semantic word embeddings. However, the generator is merely learned on the seen categories while no constraint is applied to the unseen categories, leading to poor generalization ability. In this work, we propose a Consistent Structural Relation Learning (CSRL) approach to constrain the generating of unseen visual features by exploiting the structural relations between seen and unseen categories. We observe that different categories are usually with similar relations in either semantic word embedding space or visual feature space.

consistent structural relation learning, unseen category, visual feature, (5 more...)

Neural Information Processing Systems

Oct-10-2024, 13:20:53 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.64)
  - Machine Learning (0.41)