MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
–Neural Information Processing Systems
By only training a query-based image instance segmentation model, MinVIS outperforms the previous best result on the challenging Occluded VIS dataset by over 10% AP .
Neural Information Processing Systems
Aug-18-2025, 22:30:19 GMT