Zyda-2: a 5 Trillion Token High-Quality Dataset