Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
–Neural Information Processing Systems
Fine-tuning foundation models often compromises their robustness to distribution shifts. To remedy this, most robust fine-tuning methods aim to preserve the pre-trained features. However, not all pre-trained features are robust and those methods are largely indifferent to which ones to preserve. We propose dual risk minimization (DRM), which combines empirical risk minimization with worst-case risk minimization, to better preserve the core features of downstream tasks. In particular, we utilize core-feature descriptions generated by LLMs to induce core-based zero-shot predictions which then serve as proxies to estimate the worst-case risk.
Neural Information Processing Systems
May-27-2025, 06:02:48 GMT
- Technology: