An Empirical Investigation of Domain Generalization with Empirical Risk Minimizers
–Neural Information Processing Systems
Recent work demonstrates that deep neural networks trained using Empirical Risk Minimization (ERM) can generalize under distribution shift, outperforming specialized training algorithms for domain generalization. The goal of this paper is to further understand this phenomenon. In particular, we study the extent to which the seminal domain adaptation theory of Ben-David et al. (2007) explains the performance of ERMs. Perhaps surprisingly, we find that this theory does not provide a tight explanation of the out-of-domain generalization observed across a large number of ERM models trained on three popular domain generalization datasets. This motivates us to investigate other possible measures--that, however, lack theory--which could explain generalization in this setting.
Neural Information Processing Systems
Jan-19-2025, 12:11:18 GMT
- Technology: