The GAN is dead; long live the GAN! A Modern GAN Baseline

Huang, Yiwen, Gokaslan, Aaron, Kuleshov, Volodymyr, Tompkin, James

Jan-9-2025–arXiv.org Artificial Intelligence

There is a widely-spread claim that GANs are difficult to train, and GAN architectures in the literature are littered with empirical tricks. We provide evidence against this claim and build a modern GAN baseline in a more principled manner. First, we derive a well-behaved regularized relativistic GAN loss that addresses issues of mode dropping and non-convergence that were previously tackled via a bag of ad-hoc tricks. We analyze our loss mathematically and prove that it admits local convergence guarantees, unlike most existing relativistic losses. Second, this loss allows us to discard all ad-hoc tricks and replace outdated backbones used in common GANs with modern architectures. Using StyleGAN2 as an example, we present a roadmap of simplification and modernization that results in a new minimalist baseline--R3GAN ("Re-GAN"). Despite being simple, our approach surpasses StyleGAN2 on FFHQ, ImageNet, CIFAR, and Stacked MNIST datasets, and compares favorably against state-of-the-art GANs and diffusion models.

artificial intelligence, arxiv preprint arxiv, machine learning, (11 more...)

arXiv.org Artificial Intelligence

Jan-9-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Germany (0.14)
  - Netherlands (0.14)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Information Technology (0.67)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)
    - Representation & Reasoning (0.92)
    - Vision (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)