Less Memory Means smaller GPUs: Backpropagation with Compressed Activations