SEED: Domain-Specific Data Curation With Large Language Models