Model compression as constrained optimization, with application to neural nets. Part II: quantization

Open in new window