Permute, Quantize, and Fine-tune: Efficient Compression of Neural Networks

Open in new window