Distribution Adaptive INT8 Quantization for Training CNNs