High-dimensional Analysis of Synthetic Data Selection