Joint Embedding Variational Bayes
We introduce Variational Joint Embedding (VJE), a framework that synthesizes joint embedding and variational inference to enable self-supervised learning of probabilistic representations in a reconstruction-free, non-contrastive setting. Compared to energy-based predictive objectives that optimize pointwise discrepancies, VJE maximizes a symmetric conditional evidence lower bound (ELBO) for a latent-variable model defined directly on encoder embeddings. We instantiate the conditional likelihood with a heavy-tailed Student-$t$ model using a polar decomposition that explicitly decouples directional and radial factors to prevent norm-induced instabilities during training. VJE employs an amortized inference network to parameterize a diagonal Gaussian variational posterior whose feature-wise variances are shared with the likelihood scale to capture anisotropic uncertainty without auxiliary projection heads. Across ImageNet-1K, CIFAR-10/100, and STL-10, VJE achieves performance comparable to standard non-contrastive baselines under linear and k-NN evaluation. We further validate these probabilistic semantics through one-class CIFAR-10 anomaly detection, where likelihood-based scoring under the proposed model outperforms comparable self-supervised baselines.
Feb-6-2026
- Country:
- North America
- United States
- New York (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Canada > Ontario
- Waterloo Region > Waterloo (0.04)
- Toronto (0.04)
- United States
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (0.46)
- Technology: