Towards Lower Bit Multiplication for Convolutional Neural Network Training