Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models

May-27-2025, 06:02:48 GMT–Neural Information Processing Systems

Fine-tuning foundation models often compromises their robustness to distribution shifts. To remedy this, most robust fine-tuning methods aim to preserve the pre-trained features. However, not all pre-trained features are robust and those methods are largely indifferent to which ones to preserve. We propose dual risk minimization (DRM), which combines empirical risk minimization with worst-case risk minimization, to better preserve the core features of downstream tasks. In particular, we utilize core-feature descriptions generated by LLMs to induce core-based zero-shot predictions which then serve as proxies to estimate the worst-case risk.

dual risk minimization, fine-tuning zero-shot model, next-level robustness, (1 more...)

Neural Information Processing Systems

May-27-2025, 06:02:48 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.99)