Dense Unsupervised Learning for Video Segmentation

Jan-19-2025, 07:41:49 GMT–Neural Information Processing Systems

We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train such a model results in a degenerate solution. We propose to prevent this with a simple regularisation scheme, accommodating the equivariance property of the segmentation task to similarity transformations.

dense unsupervised learning, video segmentation

Neural Information Processing Systems

Jan-19-2025, 07:41:49 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.67)
  - Machine Learning > Unsupervised or Indirectly Supervised Learning (0.67)