Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers Yiwei Lu