Goto

Collaborating Authors

 mode collapse region



Reviews: Improved Precision and Recall Metric for Assessing Generative Models

Neural Information Processing Systems

This paper proposes a new metric for mode collapse, which is a scalar value that can be read off from previously proposed measure of mode collapse in PacGAN. Precisely, in the mode collapse region, one can read the two points: (i) where the mode collapse region touches vertical axis ( \delta -axis) and (ii) where the mode collapse r region touches \delta 1 line. Each one is exactly the same as P_r(support{P_g}) and P_g(support{P_r}) that defend the proposed scalar valued mode collapse measure. This should be explained precisely in the paper, as (i) PacGAN introduced a proper mathematical notion of mode collapse earlier, (ii) the mode collapse region strictly generalizes the proposed metric (iii) mode collapse regions is the foundation of understanding mode collapse theoretically. A new estimator based on nearest neighbor distances are proposed, with extensive numerical validation of the proposed metric.


Comparison of Generative Adversarial Networks Architectures Which Reduce Mode Collapse

arXiv.org Machine Learning

Generative Adversarial Networks are known for their high quality outputs and versatility. However, they also suffer the mode collapse in their output data distribution. There have been many efforts to revamp GANs model and reduce mode collapse. This paper focuses on two of these models, PacGAN and VEEGAN. This paper explains the mathematical theory behind aforementioned models, and compare their degree of mode collapse with vanilla GAN using MNIST digits as input data. The result indicates that PacGAN performs slightly better than vanilla GAN in terms of mode collapse, and VEEGAN performs worse than both PacGAN and vanilla GAN. VEEGAN's poor performance may be attributed to average autoencoder loss in its objective function and small penalty for blurry features.


PacGAN: The power of two samples in generative adversarial networks

Neural Information Processing Systems

Generative adversarial networks (GANs) are a technique for learning generative models of complex data distributions from samples. Despite remarkable advances in generating realistic images, a major shortcoming of GANs is the fact that they tend to produce samples with little diversity, even when trained on diverse datasets. This phenomenon, known as mode collapse, has been the focus of much recent work. We study a principled approach to handling mode collapse, which we call packing. The main idea is to modify the discriminator to make decisions based on multiple samples from the same class, either real or artificially generated. We draw analysis tools from binary hypothesis testing--in particular the seminal result of Blackwell [4]--to prove a fundamental connection between packing and mode collapse. We show that packing naturally penalizes generators with mode collapse, thereby favoring generator distributions with less mode collapse during the training process. Numerical experiments on benchmark datasets suggest that packing provides significant improvements.


PacGAN: The power of two samples in generative adversarial networks

Neural Information Processing Systems

Generative adversarial networks (GANs) are a technique for learning generative models of complex data distributions from samples. Despite remarkable advances in generating realistic images, a major shortcoming of GANs is the fact that they tend to produce samples with little diversity, even when trained on diverse datasets. This phenomenon, known as mode collapse, has been the focus of much recent work. We study a principled approach to handling mode collapse, which we call packing. The main idea is to modify the discriminator to make decisions based on multiple samples from the same class, either real or artificially generated. We draw analysis tools from binary hypothesis testing--in particular the seminal result of Blackwell [4]--to prove a fundamental connection between packing and mode collapse. We show that packing naturally penalizes generators with mode collapse, thereby favoring generator distributions with less mode collapse during the training process. Numerical experiments on benchmark datasets suggest that packing provides significant improvements.


PacGAN: The power of two samples in generative adversarial networks

arXiv.org Machine Learning

Generative adversarial networks (GANs) are innovative techniques for learning generative models of complex data distributions from samples. Despite remarkable recent improvements in generating realistic images, one of their major shortcomings is the fact that in practice, they tend to produce samples with little diversity, even when trained on diverse datasets. This phenomenon, known as mode collapse, has been the main focus of several recent advances in GANs. Yet there is little understanding of why mode collapse happens and why existing approaches are able to mitigate mode collapse. We propose a principled approach to handling mode collapse, which we call packing. The main idea is to modify the discriminator to make decisions based on multiple samples from the same class, either real or artificially generated. We borrow analysis tools from binary hypothesis testing---in particular the seminal result of Blackwell [Bla53]---to prove a fundamental connection between packing and mode collapse. We show that packing naturally penalizes generators with mode collapse, thereby favoring generator distributions with less mode collapse during the training process. Numerical experiments on benchmark datasets suggests that packing provides significant improvements in practice as well.