Gromov-Wasserstein Autoencoders
Nakagawa, Nao, Togo, Ren, Ogawa, Takahiro, Haseyama, Miki
–arXiv.org Artificial Intelligence
Variational Autoencoder (VAE)-based generative models offer flexible representation learning by incorporating meta-priors, general premises considered beneficial for downstream tasks. However, the incorporated meta-priors often involve ad-hoc model deviations from the original likelihood architecture, causing undesirable changes in their training. In this paper, we propose a novel representation learning method, Gromov-Wasserstein Autoencoders (GWAE), which directly matches the latent and data distributions using the variational autoencoding scheme. Instead of likelihood-based objectives, GWAE models minimize the Gromov-Wasserstein (GW) metric between the trainable prior and given data distributions. The GW metric measures the distance structure-oriented discrepancy between distributions even with different dimensionalities, which provides a direct measure between the latent and data spaces. By restricting the prior family, we can introduce meta-priors into the latent space without changing their objective. The empirical comparisons with VAE-based models show that GWAE models work in two prominent meta-priors, disentanglement and clustering, with their GW objective unchanged.
arXiv.org Artificial Intelligence
Feb-24-2023
- Country:
- Africa > Togo (0.04)
- North America
- United States > California
- Santa Clara County > Palo Alto (0.04)
- Canada > Ontario
- Toronto (0.14)
- United States > California
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Japan
- Hokkaidō (0.04)
- Genre:
- Research Report (0.82)
- Technology: