LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Zheng, Yaowei, Zhang, Richong, Zhang, Junhao, Ye, Yanhan, Luo, Zheyan, Feng, Zhangchi, Ma, Yongqiang

arXiv.org Artificial Intelligence 

Large language models (LLMs) (Zhao et al., 2023) We minimize the dependencies of these modules present remarkable reasoning capabilities and empower on specific models and datasets, allowing the framework a wide range of applications, such as question to flexibly scale to hundreds of models and answering (Jiang et al., 2023b), machine translation datasets. Concretely, we first establish a model registry (Wang et al., 2023c; Jiao et al., 2023a), and where the Model Loader can precisely attach information extraction (Jiao et al., 2023b). Subsequently, adapters to the pre-trained models by identifying a substantial number of LLMs are developed exact layers. Then we develop a data description and accessible through open-source communities.