Accelerating CNN Training by Sparsifying Activation Gradients

Open in new window