Alchemist: Turning Public Text-to-Image Data into Generative Gold
–Neural Information Processing Systems
Pre-training equips text-to-image (T2I) models with broad world knowledge, but this alone is often insufficient to achieve high aesthetic quality and alignment. Consequently, supervised fine-tuning (SFT) is crucial for further refinement. However, its effectiveness highly depends on the quality of the fine-tuning dataset. Existing public SFT datasets frequently target narrow domains (e.g., anime or specific art styles), and the creation of high-quality, general-purpose SFT datasets remains a significant challenge. Current curation methods are often costly and struggle to identify truly impactful samples.
Neural Information Processing Systems
Jun-15-2026, 13:14:56 GMT
- Country:
- Asia (0.28)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Information Technology (0.67)
- Transportation (0.46)
- Leisure & Entertainment > Sports (0.46)
- Technology: