Efficiently Training A Flat Neural Network Before It has been Quantizated