Little Giants: Synthesizing High-Quality Embedding Data at Scale