Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Neural Information Processing Systems 

By the second iteration, it exceeds the performance of RLHF-based methods across all metrics, achieving these results with less data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found