CoMIR: Contrastive Multimodal Image Representation for Registration

Oct-11-2024, 11:49:37 GMT–Neural Information Processing Systems

We propose contrastive coding to learn shared, dense image representations, referred to as CoMIRs (Contrastive Multimodal Image Representations). CoMIRs enable the registration of multimodal images where existing registration methods often fail due to a lack of sufficiently similar image structures. CoMIRs reduce the multimodal registration problem to a monomodal one, in which general intensity-based, as well as feature-based, registration algorithms can be applied. The method involves training one neural network per modality on aligned images, using a contrastive loss based on noise-contrastive estimation (InfoNCE). Unlike other contrastive coding methods, used for, e.g., classification, our approach generates image-like representations that contain the information shared between modalities.

comir, contrastive multimodal image representation, representation, (6 more...)

Neural Information Processing Systems

Oct-11-2024, 11:49:37 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.87)
  - Artificial Intelligence > Machine Learning (0.58)