Improving Deep Generative Models on Many-To-One Image-to-Image Translation

Feb-22-2024–arXiv.org Artificial Intelligence

Deep generative models have been applied to multiple applications in image-to-image translation. Generative Adversarial Networks and Diffusion Models have presented impressive results, setting new state-of-the-art results on these tasks. Most methods have symmetric setups across the different domains in a dataset. These methods assume that all domains have either multiple modalities or only one modality. However, there are many datasets that have a many-to-one relationship between two domains. In this work, we first introduce a Colorized MNIST dataset and a Color-Recall score that can provide a simple benchmark for evaluating models on many-to-one translation. We then introduce a new asymmetric framework to improve existing deep generative models on many-to-one image-to-image translation. We apply this framework to StarGAN V2 and show that in both unsupervised and semi-supervised settings, the performance of this new model improves on many-to-one image-to-image translation.

dataset, image-to-image translation, translation, (14 more...)

arXiv.org Artificial Intelligence

Feb-22-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Maryland > Prince George's County
    - College Park (0.14)
  - California > San Francisco County
    - San Francisco (0.04)
- Europe > Germany
  - Saarland > Saarbrücken (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.81)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found