EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

Ou, Yixin, Zhang, Ningyu, Gui, Honghao, Xu, Ziwen, Qiao, Shuofei, Xue, Yida, Fang, Runnan, Liu, Kangwei, Li, Lei, Bi, Zhen, Zheng, Guozhou, Chen, Huajun

Feb-5-2024–arXiv.org Artificial Intelligence

In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs). To construct high-quality instruction datasets, many instruction processing approaches have been proposed, aiming to achieve a delicate balance between data quantity and data quality. Nevertheless, due to inconsistencies that persist among various instruction processing methods, there is no standard open-source instruction processing implementation framework available for the community, which hinders practitioners from further developing and advancing. To facilitate instruction processing research and development, we present EasyInstruct, an easy-to-use instruction processing framework for LLMs, which modularizes instruction generation, selection, and prompting, while also considering their combination and interaction. EasyInstruct is publicly released and actively maintained at https://github.com/zjunlp/EasyInstruct, along with a running demo App at https://huggingface.co/spaces/zjunlp/EasyInstruct for quick-start, calling for broader research centered on instruction data.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Feb-5-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - UAE (0.14)
- Europe (0.46)
- North America > United States
  - Hawaii (0.14)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.71)
  - Natural Language > Large Language Model (1.00)