Distilling Diffusion Models into Conditional GANs

Kang, Minguk, Zhang, Richard, Barnes, Connelly, Paris, Sylvain, Kwak, Suha, Park, Jaesik, Shechtman, Eli, Zhu, Jun-Yan, Park, Taesung

Jun-13-2024–arXiv.org Artificial Intelligence

We propose a method to distill a complex multistep diffusion model into a single-step conditional GAN student model, dramatically accelerating inference, while preserving image quality. Our approach interprets diffusion distillation as a paired image-to-image translation task, using noise-to-image pairs of the diffusion model's ODE trajectory. For efficient regression loss computation, we propose E-LatentLPIPS, a perceptual loss operating directly in diffusion model's latent space, utilizing an ensemble of augmentations. Furthermore, we adapt a diffusion model to construct a multi-scale discriminator with a text alignment loss to build an effective conditional GAN-based formulation. E-LatentLPIPS converges more efficiently than many existing distillation methods, even accounting for dataset construction costs. We demonstrate that our one-step generator outperforms cutting-edge one-step diffusion distillation models -- DMD, SDXL-Turbo, and SDXL-Lightning -- on the zero-shot COCO benchmark.

diffusion2gan, distilling diffusion model, international conference, (8 more...)

arXiv.org Artificial Intelligence

Jun-13-2024

arXiv.org PDF

Add feedback

Country:
- Asia > South Korea
  - Gyeongsangbuk-do > Pohang (0.04)
  - Seoul > Seoul (0.04)
- Europe > Italy
  - Calabria > Catanzaro Province > Catanzaro (0.04)
- North America > Canada
  - Ontario > Toronto (0.14)

Genre:
- Research Report (0.82)

Industry:
- Education (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found