Task-Robust Pre-Training for Worst-Case Downstream Adaptation

Dec-24-2025, 03:42:24 GMT–Neural Information Processing Systems

Pre-training has achieved remarkable success when transferred to downstream tasks. In machine learning, we care about not only the good performance of a model but also its behavior under reasonable shifts of condition. The same philosophy holds when pre-training a foundation model. However, the foundation model may not uniformly behave well for a series of related downstream tasks. This happens, for example, when conducting mask recovery regression where the recovery ability or the training instances diverge like pattern features are extracted dominantly on pre-training, but semantic features are also required on a downstream task.

downstream task, task-robust pre-training, worst-case downstream adaptation, (5 more...)

Neural Information Processing Systems

Dec-24-2025, 03:42:24 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.59)