Primer: Searching for Efficient Transformers for Language Modeling

Open in new window