SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation Zhuoyan Luo

Neural Information Processing Systems 

This paper studies referring video object segmentation (RVOS) by boosting video-level visual-linguistic alignment. Recent approaches model the RVOS task as a sequence prediction problem and perform multi-modal interaction as well as segmentation for each frame separately.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found