Multimodal Generative Models for Scalable Weakly-Supervised Learning

Feb-14-2020, 17:11:08 GMT–Neural Information Processing Systems

Learning a joint representation of these modalities should yield deeper and more useful representations.Previous generative approaches to multi-modal input either do not learn a joint distribution or require additional computation to handle missing data. Here, we introduce a multimodal variational autoencoder (MVAE) that uses a product-of-experts inference network and a sub-sampled training paradigm to solve the multi-modal inference problem. Notably, our model shares parameters to efficiently learn under any combination of missing modalities. We apply the MVAE on four datasets and match state-of-the-art performance using many fewer parameters. In addition, we show that the MVAE is directly applicable to weakly-supervised learning, and is robust to incomplete supervision.

modality, multimodal generative model, scalable weakly-supervised learning, (2 more...)

Neural Information Processing Systems

Feb-14-2020, 17:11:08 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Inductive Learning (0.65)
  - Natural Language > Generation (0.40)