GTQuery-based flowOp4cal flow
–Neural Information Processing Systems
Video Semantic Segmentation (VSS) involves assigning a semantic label to each pixel in a video sequence. Prior work in this field has demonstrated promising results by extending image semantic segmentation models to exploit temporal relationships across video frames; however, these approaches often incur significant computational costs. In this paper, we propose an efficient mask propagation framework for VSS, called MPVSS. Our approach first employs a strong querybased image segmentor on sparse key frames to generate accurate binary masks and class predictions. We then design a flow estimation module utilizing the learned queries to generate a set of segment-aware flow maps, each associated with a mask prediction from the key frame.
Neural Information Processing Systems
Apr-25-2026, 07:06:25 GMT
- Technology:
- Information Technology
- Graphics (0.76)
- Sensing and Signal Processing (0.68)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Machine Learning > Neural Networks
- Deep Learning (0.46)
- Information Technology