InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Chen, Lichang, Chen, Jiuhai, Goldstein, Tom, Huang, Heng, Zhou, Tianyi

Aug-8-2023–arXiv.org Artificial Intelligence

Large language models~(LLMs) are instruction followers, but it can be challenging to find the best instruction for different situations, especially for black-box LLMs on which backpropagation is forbidden. Instead of directly optimizing the discrete instruction, we optimize a low-dimensional soft prompt applied to an open-source LLM to generate the instruction for the black-box LLM. On each iteration of the proposed method, which we call InstructZero, a soft prompt is converted into an instruction using the open-source LLM, which is then submitted to the black-box LLM for zero-shot evaluation, and the performance is sent to Bayesian optimization to produce new soft prompts improving the zero-shot performance. We evaluate InstructZero on different combinations of open-source LLMs and APIs including Vicuna and ChatGPT. Our results show that InstructZero outperforms SOTA auto-instruction methods across a variety of downstream tasks. Our code and data are publicly available at https://github.com/Lichang-Chen/InstructZero.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Aug-8-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Maryland (0.04)
  - Dominican Republic (0.04)
- Africa
  - Togo (0.04)
  - Eritrea (0.04)
  - Burundi (0.04)

Genre:
- Research Report > New Finding (0.86)

Industry:
- Transportation > Air (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found