LookupFFN: Making Transformers Compute-lite for CPU inference

Open in new window