Shrinking the Giant : Quasi-Weightless Transformers for Low Energy Inference