Multimodal Learning with Deep Boltzmann Machines
–Neural Information Processing Systems
We propose a Deep Boltzmann Machine for learning a generative model of multimodal data. We show how to use the model to extract a meaningful representation of multimodal data. We find that the learned representation is useful for classification and information retreival tasks, and hence conforms to some notion of semantic similarity. The model defines a probability density over the space of multimodal inputs. By sampling from the conditional distributions over each data modality, it possible to create the representation even when some data modalities are missing.
Neural Information Processing Systems
Apr-6-2023, 12:33:37 GMT
- Technology: