Reviews: Multimodal Generative Models for Scalable Weakly-Supervised Learning

Oct-7-2024, 05:11:30 GMT–Neural Information Processing Systems

This paper presents a generative approach to multimodal deep learning based on a product-of-experts (PoE) inference network. The main idea is to assume the joint distribution over all modalities factorises into a product of single-modality data-generating distributions when conditioned on the latent space, and use this to derive the structure and factorisation of the variational posterior. The proposed model shares parameters to efficiently handle any combination of missing modalities, and experiments indicate the model's efficacy on various benchmark datasets. The idea is intuitive, the exposition is well-written and easy to follow, and the results are thorough and compelling. I have a few questions / comments, mainly about the relationship of this work with respect to previous approaches ([15] and [21] in the text).

modality, multimodal generative model, scalable weakly-supervised learning, (11 more...)

Neural Information Processing Systems

Oct-7-2024, 05:11:30 GMT

Conferences Web Page

Add feedback

Genre:
- Summary/Review (0.36)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)