Zyda: A 1.3T Dataset for Open Language Modeling