Block-wise Bit-Compression of Transformer-based Models

Open in new window