Sparse models and cheap SRAM for language models

Open in new window