Distributionally Robust Optimization and Generalization in Kernel Methods

Oct-9-2024, 14:51:19 GMT–Neural Information Processing Systems

Distributionally robust optimization (DRO) has attracted attention in machine learning due to its connections to regularization, generalization, and robustness. Existing work has considered uncertainty sets based on phi-divergences and Wasserstein distances, each of which have drawbacks. In this paper, we study DRO with uncertainty sets measured via maximum mean discrepancy (MMD). We show that MMD DRO is roughly equivalent to regularization by the Hilbert norm and, as a byproduct, reveal deep connections to classic results in statistical learning. In particular, we obtain an alternative proof of a generalization bound for Gaussian kernel ridge regression via a DRO lense.

distributionally robust optimization and generalization, kernel method, regularization, (1 more...)

Neural Information Processing Systems

Oct-9-2024, 14:51:19 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Kernel Methods (0.47)