CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
Baumann, Alexander, Ayala, Leonardo, Seidlitz, Silvia, Sellner, Jan, Studier-Fischer, Alexander, Özdemir, Berkin, Maier-Hein, Lena, Ilic, Slobodan
–arXiv.org Artificial Intelligence
Spectral imaging offers promising applications across diverse domains, including medicine and urban scene understanding, and is already established as a critical modality in remote sensing. However, variability in channel dimensionality and captured wavelengths among spectral cameras impede the development of AI-driven methodologies, leading to camera-specific models with limited generalizability and inadequate cross-camera applicability. To address this bottleneck, we introduce CARL, a model for Camera-Agnostic Representation Learning across RGB, multispectral, and hyperspectral imaging modalities. To enable the conversion of a spectral image with any channel dimensionality to a camera-agnostic representation, we introduce a novel spectral encoder, featuring a self-attention-cross-attention mechanism, to distill salient spectral information into learned spectral representations. Spatio-spectral pre-training is achieved with a novel feature-based self-supervision strategy tailored to CARL. Large-scale experiments across the domains of medical imaging, autonomous driving, and satellite imaging demonstrate our model's unique robustness to spectral heterogeneity, outperforming on datasets with simulated and real-world cross-camera spectral variations. The scalability and versatility of the proposed approach position our model as a backbone for future spectral foundation models.
arXiv.org Artificial Intelligence
Sep-29-2025
- Country:
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Technology: