BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages