Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective

Neural Information Processing Systems 

The two-stage fine-tuning (FT) method, linear probing (LP) then fine-tuning (LP-FT), outperforms linear probing and FT alone. This holds true for both in-distribution (ID) and out-of-distribution (OOD) data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found