Spatiotemporal Joint Filter Decomposition in 3D Convolutional Neural Networks

Apr-24-2026, 23:10:54 GMT–Neural Information Processing Systems

In this paper, we introduce spatiotemporal joint filter decomposition to decouple spatial and temporal learning, while preserving spatiotemporal dependency in a video. A 3D convolutional filter is now jointly decomposed over a set of spatial and temporal filter atoms respectively. In this way, a 3D convolutional layer becomes three: a temporal atom layer, a spatial atom layer, and a joint coefficient layer, all three remaining convolutional. One obvious arithmetic manipulation allowed in our joint decomposition is to swap spatial or temporal atoms with a set of atoms that have the same number but different sizes, while keeping the remaining unchanged. For example, as shown later, we can now achieve tempo-invariance by simply dilating temporal atoms only. To illustrate this useful atom-swapping property, we further demonstrate how such a decomposition permits the direct learning of 3DCNNs with full-size videos through iterations of two consecutive sub-stages of learning: In the temporal stage, full-temporal downsampled-spatial data are used to learn temporal atoms and joint coefficients while fixing spatial atoms. In the spatial stage, full-spatial downsampled-temporal data are used for spatial atoms and joint coefficients while fixing temporal atoms. We show empirically on multiple action recognition datasets that, the decoupled spatiotemporal learning significantly reduces the model memory footprints, and allows deep 3DCNNs to model high-spatial long-temporal dependency with limited computational resources while delivering comparable performance.

artificial intelligence, machine learning, video, (16 more...)

Neural Information Processing Systems

Apr-24-2026, 23:10:54 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.82)

Duplicate Docs Excel Report

Title
1bb91f73e9d31ea2830a5e73ce3ed328-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found