EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
–Neural Information Processing Systems
Recent advancements in generation models have showcased remarkable capabilities in generating fantastic content. However, most of them are trained on proprietary high-quality data, and some models withhold their parameters and only provide accessible application programming interfaces (APIs), limiting their benefits for downstream tasks. To explore the feasibility of training a text-to-image generation model comparable to advanced models using publicly available resources, we introduce EvolveDirector. This framework interacts with advanced models through their public APIs to obtain text-image data pairs to train a base model. Our experiments with extensive data indicate that the model trained on generated data of the advanced model can approximate its generation capability.
Neural Information Processing Systems
Mar-27-2025, 11:28:43 GMT
- Country:
- Europe > Germany (0.14)
- North America > United States (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.68)
- Research Report
- Industry:
- Education > Educational Setting
- Online (0.46)
- Media > Photography (0.92)
- Education > Educational Setting
- Technology: