Spatiotemporal Residual Networks for Video Action Recognition
–Neural Information Processing Systems
Two-stream Convolutional Networks (ConvNets) have shown strong performance for human action recognition in videos. Recently, Residual Networks (ResNets) have arisen as a new technique to train extremely deep architectures. In this paper, we introduce spatiotemporal ResNets as a combination of these two approaches.
Neural Information Processing Systems
Mar-17-2026, 07:55:36 GMT
- Technology: