Implicit variance regularization in non-contrastive SSL

Jan-19-2025, 21:58:40 GMT–Neural Information Processing Systems

Non-contrastive SSL methods like BYOL and SimSiam rely on asymmetric predictor networks to avoid representational collapse without negative samples. Yet, how predictor networks facilitate stable learning is not fully understood. While previous theoretical analyses assumed Euclidean losses, most practical implementations rely on cosine similarity. To gain further theoretical insight into non-contrastive SSL, we analytically study learning dynamics in conjunction with Euclidean and cosine similarity in the eigenspace of closed-form linear predictor networks. We show that both avoid collapse through implicit variance regularization albeit through different dynamical mechanisms.

implicit variance regularization, non-contrastive ssl, variance regularization, (4 more...)

Neural Information Processing Systems

Jan-19-2025, 21:58:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.87)