Searching for Better Spatio-temporal Alignment in Few-Shot Action Recognition
–Neural Information Processing Systems
Spatio-Temporal feature matching and alignment are essential for few-shot action recognition as they determine the coherence and effectiveness of the temporal patterns. Nevertheless, this process could be not reliable, especially when dealing with complex video scenarios. In this paper, we propose to improve the performance of matching and alignment from the end-to-end design of models. Our solution comes at two-folds. First, we encourage to enhance the extracted Spatio-Temporal representations from few-shot videos in the perspective of architectures.
Neural Information Processing Systems
Jan-17-2025, 03:33:35 GMT
- Technology:
- Information Technology > Artificial Intelligence > Vision (0.64)