Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Honovich, Or, Scialom, Thomas, Levy, Omer, Schick, Timo
–arXiv.org Artificial Intelligence
Instruction tuning enables pretrained language models to perform new tasks from inference-time natural language descriptions. These approaches rely on vast amounts of human supervision in the form of crowdsourced datasets or user interactions. In this work, we introduce Unnatural Instructions: a large dataset of creative and diverse instructions, collected with virtually no human labor. We collect 64,000 examples by prompting a language model with three seed examples of instructions and eliciting a fourth. This set is then expanded by prompting the model to rephrase each instruction, creating a total of approximately 240,000 examples of instructions, inputs, and outputs. Experiments show that despite containing a fair amount of noise, training on Unnatural Instructions rivals the effectiveness of training on open-source manually-curated datasets, surpassing the performance of models such as T0++ and Tk-Instruct across various benchmarks. These results demonstrate the potential of model-generated data as a cost-effective alternative to crowdsourcing for dataset expansion and diversification.
arXiv.org Artificial Intelligence
Dec-19-2022
- Country:
- Asia
- Middle East
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Israel > Tel Aviv District
- Thailand > Pattani
- Pattani (0.04)
- Middle East
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Ireland > Leinster
- North America
- Dominican Republic (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York > New York County
- New York City (0.04)
- Texas
- Chambers County (0.04)
- Kleberg County (0.04)
- Louisiana > Orleans Parish
- Asia
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Government > Regional Government (0.46)
- Technology: