Shrinking the Giant : Quasi-Weightless Transformers for Low Energy Inference

Open in new window