CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation
–Neural Information Processing Systems
Taking an image and a natural language sentence as input, a referring image segmentation (RIS) model is required to predict a mask for the object described by the sentence.
Neural Information Processing Systems
Aug-15-2025, 05:11:20 GMT
- Country:
- Asia > China
- Guangdong Province (0.04)
- Shaanxi Province > Xi'an (0.04)
- Asia > China
- Genre:
- Research Report (0.68)
- Technology: