Faster Convergence for Transformer Fine-tuning with Line Search Methods

Open in new window