The GAN is dead; long live the GAN! A Modern GAN Baseline
Huang, Yiwen, Gokaslan, Aaron, Kuleshov, Volodymyr, Tompkin, James
–arXiv.org Artificial Intelligence
There is a widely-spread claim that GANs are difficult to train, and GAN architectures in the literature are littered with empirical tricks. We provide evidence against this claim and build a modern GAN baseline in a more principled manner. First, we derive a well-behaved regularized relativistic GAN loss that addresses issues of mode dropping and non-convergence that were previously tackled via a bag of ad-hoc tricks. We analyze our loss mathematically and prove that it admits local convergence guarantees, unlike most existing relativistic losses. Second, this loss allows us to discard all ad-hoc tricks and replace outdated backbones used in common GANs with modern architectures. Using StyleGAN2 as an example, we present a roadmap of simplification and modernization that results in a new minimalist baseline--R3GAN ("Re-GAN"). Despite being simple, our approach surpasses StyleGAN2 on FFHQ, ImageNet, CIFAR, and Stacked MNIST datasets, and compares favorably against state-of-the-art GANs and diffusion models.
arXiv.org Artificial Intelligence
Jan-9-2025
- Country:
- Europe
- Germany (0.14)
- Netherlands (0.14)
- Europe
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology (0.67)
- Technology: