Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation

Open in new window