Domain Generalization with Small Data
Chen, Kecheng, Gal, Elena, Yan, Hong, Li, Haoliang
–arXiv.org Artificial Intelligence
In this work, we propose to tackle the problem of domain generalization in the context of \textit{insufficient samples}. Instead of extracting latent feature embeddings based on deterministic models, we propose to learn a domain-invariant representation based on the probabilistic framework by mapping each data point into probabilistic embeddings. Specifically, we first extend empirical maximum mean discrepancy (MMD) to a novel probabilistic MMD that can measure the discrepancy between mixture distributions (i.e., source domains) consisting of a series of latent distributions rather than latent points. Moreover, instead of imposing the contrastive semantic alignment (CSA) loss based on pairs of latent points, a novel probabilistic CSA loss encourages positive probabilistic embedding pairs to be closer while pulling other negative ones apart. Benefiting from the learned representation captured by probabilistic models, our proposed method can marriage the measurement on the \textit{distribution over distributions} (i.e., the global perspective alignment) and the distribution-based contrastive semantic alignment (i.e., the local perspective alignment). Extensive experimental results on three challenging medical datasets show the effectiveness of our proposed method in the context of insufficient data compared with state-of-the-art methods.
arXiv.org Artificial Intelligence
Feb-8-2024
- Country:
- Asia
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.04)
- Genre:
- Research Report (0.70)
- Industry:
- Health & Medicine > Therapeutic Area (0.94)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.46)
- Performance Analysis > Accuracy (0.46)
- Statistical Learning (0.93)
- Representation & Reasoning > Uncertainty (1.00)
- Vision (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence