Memory-Efficient Fine-Tuning of Transformers via Token Selection

Open in new window