Spatiotemporal Joint Filter Decomposition in 3D Convolutional Neural Networks Zichen Miao
–Neural Information Processing Systems
For example, as shown later, we can now achieve tempo-invariance by simply dilating temporal atoms only. To illustrate this useful atom-swapping property, we further demonstrate how such a decomposition permits the direct learning of 3D CNNs with full-size videos through iterations of two consecutive sub-stages of learning: In the temporal stage, full-temporal downsampled-spatial data are used to learn temporal atoms and joint coefficients while fixing spatial atoms.
Neural Information Processing Systems
Oct-2-2025, 16:33:26 GMT