Instruction Following without Instruction Tuning

Hewitt, John, Liu, Nelson F., Liang, Percy, Manning, Christopher D.

Sep-21-2024–arXiv.org Artificial Intelligence

Instruction tuning commonly means finetuning a language model on instructionresponse pairs. We discover two forms of adaptation (tuning) that are deficient compared to instruction tuning, yet still yield instruction following; we call this implicit instruction tuning. We first find that instruction-response pairs are not necessary: training solely on responses, without any corresponding instructions, yields instruction following. This suggests pretrained models have an instruction-response mapping which is revealed by teaching the model the desired distribution of responses. However, we then find it's not necessary to teach the desired distribution of responses: instruction-response training on narrow-domain data like poetry still leads to broad instruction-following behavior like recipe generation. In particular, when instructions are very different from those in the narrow finetuning domain, models' responses do not adhere to the style of the finetuning domain. To begin to explain implicit instruction tuning, we hypothesize that very simple changes to a language model's distribution yield instruction following. We support this by hand-writing a rule-based language model which yields instruction following in a product-of-experts with a pretrained model. The rules are to slowly increase the probability of ending the sequence, penalize repetition, and uniformly change 15 words' probabilities. In summary, adaptations made without being designed to yield instruction following can do so implicitly. Instruction tuning, finetuning on a broad distribution of responses (e.g., Tiramisu is made by...) conditioned on instructions (e.g., Give me a recipe for tiramisu), yields instruction following from language models for a wide range of instructions (Ouyang et al., 2022). Prior work has shown that instruction tuning is sample-efficient, requiring as few as 1000 broad-domain instruction-response pairs (Zhou et al., 2023) or a carefully crafted prompt and few-shot instruction-response examples (Lin et al., 2024). We take this a step further, exploring the idea that instruction following can be yielded from language models even implicitly, i.e., through methods not explictly designed to do so.

dataset, instruction, language model, (16 more...)

arXiv.org Artificial Intelligence

Sep-21-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Russia (0.04)
- North America
  - Canada (0.04)
  - United States
    - Alabama (0.05)
    - New York (0.04)
    - Alaska (0.04)
    - Arizona (0.04)
    - Missouri (0.04)
    - Wisconsin (0.04)
    - Connecticut (0.04)
    - Illinois (0.04)
    - Texas (0.04)
    - Hawaii (0.04)
    - Michigan (0.04)
    - Oklahoma (0.04)
    - Virginia (0.04)
    - Kentucky (0.04)
    - California > Santa Clara County
      - Palo Alto (0.04)
- Asia
  - India (0.04)
  - Russia (0.04)

Genre:
- Research Report > New Finding (0.68)

Industry:
- Government (0.46)
- Leisure & Entertainment > Games (0.30)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Rule-Based Reasoning (0.70)
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.71)
  - Machine Learning > Neural Networks
    - Deep Learning (0.96)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found