End-to-End Semantic Video Transformer for Zero-Shot Action Recognition

Open in new window