SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation Zhuoyan Luo
–Neural Information Processing Systems
This paper studies referring video object segmentation (RVOS) by boosting video-level visual-linguistic alignment. Recent approaches model the RVOS task as a sequence prediction problem and perform multi-modal interaction as well as segmentation for each frame separately.
Neural Information Processing Systems
Oct-8-2025, 17:02:59 GMT
- Country:
- Asia > China
- Guangdong Province > Shenzhen (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > China
- Genre:
- Research Report (0.66)
- Technology:
- Information Technology > Artificial Intelligence > Vision (1.00)