Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation
Eiras, Francisco, Oksuz, Kemal, Bibi, Adel, Torr, Philip H. S., Dokania, Puneet K.
–arXiv.org Artificial Intelligence
Referring Image Segmentation (RIS) - the problem of identifying objects in images through natural language sentences - is a challenging task currently mostly solved through supervised learning. However, while collecting referred annotation masks is a time-consuming process, the few existing weakly-supervised and zero-shot approaches fall significantly short in performance compared to fully-supervised learning ones. To bridge the performance gap without mask annotations, we propose a novel weakly-supervised framework that tackles RIS by decomposing it into three steps: obtaining instance masks for the object mentioned in the referencing instruction (segment), using zero-shot learning to select a potentially correct mask for the given instruction (select), and bootstrapping a model which allows for fixing the mistakes of zero-shot selection (correct). In our experiments, using only the first two steps (zero-shot segment and select) outperforms other zero-shot baselines by as much as 19%, while our full method improves upon this much stronger baseline and sets the new state-of-the-art for weakly-supervised RIS, reducing the gap between the weakly-supervised and fully-supervised methods in some cases from around 33% to as little as 14%. Identifying particular object instances in images using natural language expressions - defined in the literature as referring image segmentation (RIS) (Wang et al., 2022b; Yang et al., 2022; Wu et al., 2022; Yu et al., 2023) - is an important problem that has many real-world applications including autonomous driving, general human-robot interactions (Wang et al., 2019) or natural language-driven image editing (Chen et al., 2018) to name a few.
arXiv.org Artificial Intelligence
Oct-23-2023
- Country:
- Europe > Switzerland > Zürich > Zürich (0.14)
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Information Technology (0.34)
- Technology: