Stay on topic with Classifier-Free Guidance
Sanchez, Guillaume, Fan, Honglu, Spangher, Alexander, Levi, Elad, Ammanamanchi, Pawan Sasanka, Biderman, Stella
–arXiv.org Artificial Intelligence
Classifier-Free Guidance (CFG) [37] has recently emerged in text-to-image generation as a lightweight technique to encourage prompt-adherence in generations. In this work, we demonstrate that CFG can be used broadly as an inference-time technique in pure language modeling. We show that CFG (1) improves the performance of Pythia, GPT-2 and LLaMA-family models across an array of tasks: Q&A, reasoning, code generation, and machine translation, achieving SOTA on LAMBADA with LLaMA-7B over PaLM-540B; (2) brings improvements equivalent to a model with twice the parameter-count; (3) can stack alongside other inference-time methods like Chain-of-Thought and Self-Consistency, yielding further improvements in difficult tasks; (4) can be used to increase the faithfulness and coherence of assistants in challenging form-driven and content-driven prompts: in a human evaluation we show a 75% preference for GPT4All using CFG over baseline.
arXiv.org Artificial Intelligence
Jun-30-2023
- Country:
- Asia (1.00)
- Europe (1.00)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Personal (0.92)
- Research Report
- Experimental Study (0.67)
- New Finding (1.00)
- Industry:
- Education (0.92)
- Government > Regional Government (0.67)
- Health & Medicine
- Epidemiology (0.93)
- Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Information Technology (0.67)
- Leisure & Entertainment > Sports (1.00)
- Media
- Technology: