HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression

Open in new window