Improved Multimodal Deep Learning with Variation of Information

Sohn, Kihyuk, Shang, Wenling, Lee, Honglak

Feb-14-2020, 09:25:59 GMT–Neural Information Processing Systems

Deep learning has been successfully applied to multimodal representation learning problems, with a common strategy to learning joint representations that are shared across multiple modalities on top of layers of modality-specific networks. Nonetheless, there still remains a question how to learn a good association between data modalities; in particular, a good generative model of multimodal data should be able to reason about missing data modality given the rest of data modalities. In this paper, we propose a novel multimodal representation learning framework that explicitly aims this goal. Rather than learning with maximum likelihood, we train the model to minimize the variation of information. We provide a theoretical insight why the proposed learning objective is sufficient to estimate the data-generating joint distribution of multimodal data.

data modality, improved multimodal deep learning, representation, (4 more...)

Neural Information Processing Systems

Feb-14-2020, 09:25:59 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.79)