Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Open in new window