Spatiotemporal Residual Networks for Video Action Recognition

Mar-17-2026, 07:55:36 GMT–Neural Information Processing Systems

Two-stream Convolutional Networks (ConvNets) have shown strong performance for human action recognition in videos. Recently, Residual Networks (ResNets) have arisen as a new technique to train extremely deep architectures. In this paper, we introduce spatiotemporal ResNets as a combination of these two approaches.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Mar-17-2026, 07:55:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)