FlowFeat: Pixel-Dense Embedding of Motion Profiles
–Neural Information Processing Systems
Dense and versatile image representations underpin the success of virtually all computer vision applications. However, state-of-the-art networks, such as transformers, produce low-resolution feature grids, which are suboptimal for dense prediction tasks. To address this limitation, we present, a high-resolution and multi-task feature representation. The key ingredient behind FlowFeat is a novel distillation technique that embeds a distribution of plausible apparent motions, or . By leveraging optical flow networks and diverse video data, we develop an effective self-supervised training framework that statistically approximates the apparent motion.
Neural Information Processing Systems
Jun-14-2026, 07:02:43 GMT
- Technology:
- Information Technology > Artificial Intelligence > Vision (1.00)