GMC -- Geometric Multimodal Contrastive Representation Learning

Poklukar, Petra, Vasco, Miguel, Yin, Hang, Melo, Francisco S., Paiva, Ana, Kragic, Danica

Feb-8-2022–arXiv.org Artificial Intelligence

Learning representations of multimodal data that are both informative and robust to missing modalities at test time remains a challenging problem due to the inherent heterogeneity of data obtained from different channels. To address it, we present a novel Geometric Multimodal Contrastive (GMC) representation learning method comprised of two main components: i) a two-level architecture consisting of modality-specific base encoder, allowing to process an arbitrary number of modalities to an intermediate representation of fixed dimensionality, and a shared projection head, mapping the intermediate representations to a latent representation space; ii) a multimodal contrastive loss function that encourages the geometric alignment of the learned representations. We experimentally demonstrate that GMC representations are semantically rich and achieve state-of-the-art performance with missing modality information on three different learning problems including prediction and reinforcement learning tasks.

dataset, gmc, representation, (17 more...)

arXiv.org Artificial Intelligence

Feb-8-2022

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- North America > Canada
  - Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
- Europe
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Italy > Tuscany
    - Florence (0.04)

Genre:
- Research Report (1.00)

Industry:
- Education (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (0.68)
  - Reinforcement Learning (0.66)