EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Mar-22-2026, 16:59:40 GMT–Neural Information Processing Systems

Recent advancements in generation models have showcased remarkable capabilities in generating fantastic content. However, most of them are trained on proprietary high-quality data, and some models withhold their parameters and only provide accessible application programming interfaces (APIs), limiting their benefits for downstream tasks. To explore the feasibility of training a text-to-image generation model comparable to advanced models using publicly available resources, we introduce EvolveDirector. This framework interacts with advanced models through their public APIs to obtain text-image data pairs to train a base model. Our experiments with extensive data indicate that the model trained on generated data of the advanced model can approximate its generation capability.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Mar-22-2026, 16:59:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.61)
  - Artificial Intelligence
    - Vision (0.83)
    - Machine Learning (0.58)