LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Zheng, Yaowei, Zhang, Richong, Zhang, Junhao, Ye, Yanhan, Luo, Zheyan, Feng, Zhangchi, Ma, Yongqiang

Jun-27-2024–arXiv.org Artificial Intelligence

Large language models (LLMs) (Zhao et al., 2023) We minimize the dependencies of these modules present remarkable reasoning capabilities and empower on specific models and datasets, allowing the framework a wide range of applications, such as question to flexibly scale to hundreds of models and answering (Jiang et al., 2023b), machine translation datasets. Concretely, we first establish a model registry (Wang et al., 2023c; Jiao et al., 2023a), and where the Model Loader can precisely attach information extraction (Jiao et al., 2023b). Subsequently, adapters to the pre-trained models by identifying a substantial number of LLMs are developed exact layers. Then we develop a data description and accessible through open-source communities.

arxiv preprint arxiv, language model, zhang, (13 more...)

arXiv.org Artificial Intelligence

Jun-27-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Pennsylvania (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.04)
  - China
    - Shanghai > Shanghai (0.04)
    - Hong Kong (0.04)

Genre:
- Research Report (0.64)
- Overview (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)