More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation

Open in new window