LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Zheng, Yaowei, Zhang, Richong, Zhang, Junhao, Ye, Yanhan, Luo, Zheyan, Feng, Zhangchi, Ma, Yongqiang
–arXiv.org Artificial Intelligence
Large language models (LLMs) (Zhao et al., 2023) We minimize the dependencies of these modules present remarkable reasoning capabilities and empower on specific models and datasets, allowing the framework a wide range of applications, such as question to flexibly scale to hundreds of models and answering (Jiang et al., 2023b), machine translation datasets. Concretely, we first establish a model registry (Wang et al., 2023c; Jiao et al., 2023a), and where the Model Loader can precisely attach information extraction (Jiao et al., 2023b). Subsequently, adapters to the pre-trained models by identifying a substantial number of LLMs are developed exact layers. Then we develop a data description and accessible through open-source communities.
arXiv.org Artificial Intelligence
Jun-27-2024
- Country:
- North America
- United States > Pennsylvania (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Europe
- Austria > Vienna (0.14)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- North America
- Genre:
- Research Report (0.64)
- Overview (0.46)
- Technology: