RobustFine-tuningofZero-shotModelsviaVariance Reduction

Neural Information Processing Systems 

WhenoptimizedforOOD accuracy, the ensemble model exhibits a noticeable decline in ID accuracy, and vice versa. In contrast, we propose a sample-wise ensembling technique that can simultaneously attain the best ID and OOD accuracywithout the trade-offs.