RE-Adapt: Reverse Engineered Adaptation of Large Language Models
Fleshman, William, Van Durme, Benjamin
–arXiv.org Artificial Intelligence
We introduce RE-Adapt, an approach to fine-tuning large language models on new domains without degrading any pre-existing instruction-tuning. We reverse engineer an adapter which isolates what an instruction-tuned model has learned beyond its corresponding pretrained base model. Importantly, this requires no additional data or training. We can then fine-tune the base model on a new domain and readapt it to instruction following with the reverse engineered adapter. RE-Adapt and our low-rank variant LoRE-Adapt both outperform other methods of fine-tuning, across multiple popular LLMs and datasets, even when the models are used in conjunction with retrieval-augmented generation.
arXiv.org Artificial Intelligence
May-23-2024
- Country:
- Asia > Middle East
- UAE (0.14)
- Europe (1.00)
- North America (0.68)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.93)
- Technology: