Cut Your Losses in Large-Vocabulary Language Models

Open in new window