Zyda: A 1.3T Dataset for Open Language Modeling

Open in new window