Pre-training Small Base LMs with Fewer Tokens

Open in new window