Simple Disentanglement of Style and Content in Visual Representations
Ngweta, Lilian, Maity, Subha, Gittens, Alex, Sun, Yuekai, Yurochkin, Mikhail
–arXiv.org Artificial Intelligence
Learning visual representations with interpretable features, i.e., disentangled representations, remains a challenging problem. Existing methods demonstrate some success but are hard to apply to large-scale vision datasets like ImageNet. In this work, we propose a simple post-processing framework to disentangle content and style in learned representations from pre-trained vision models. We model the pre-trained features probabilistically as linearly entangled combinations of the latent content and style factors and develop a simple disentanglement algorithm based on the probabilistic model. We show that the method provably disentangles content and style features and verify its efficacy empirically. Our post-processed features yield significant domain generalization performance improvements when the distribution shift occurs due to style changes or style-related spurious correlations.
arXiv.org Artificial Intelligence
May-31-2023
- Country:
- Europe > Italy (0.04)
- North America > United States
- New York > Rensselaer County
- Troy (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York > Rensselaer County
- Asia
- Middle East > Israel
- Tel Aviv District > Tel Aviv (0.04)
- Japan > Honshū
- Tōhoku > Iwate Prefecture > Morioka (0.04)
- Middle East > Israel
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology
- Data Science (1.00)
- Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Machine Learning
- Neural Networks > Deep Learning (0.68)
- Statistical Learning > Regression (0.46)
- Information Technology