LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Neural Information Processing Systems 

Y et, their massive memory consumption has become a significant roadblock to large-scale training.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found