Contrastive Decoding: Open-ended Text Generation as Optimization
Li, Xiang Lisa, Holtzman, Ari, Fried, Daniel, Liang, Percy, Eisner, Jason, Hashimoto, Tatsunori, Zettlemoyer, Luke, Lewis, Mike
–arXiv.org Artificial Intelligence
Given a language model (LM), maximum probability is a poor decoding objective for open-ended generation, because it produces short and repetitive text. On the other hand, sampling can often produce incoherent text that drifts from the original topics. We propose contrastive decoding (CD), a reliable decoding approach that optimizes a contrastive objective subject to a plausibility constraint. The contrastive objective returns the difference between the likelihood under a large LM (called the expert, e.g. OPT-13B) and a small LM (called the amateur, e.g. OPT-125M), and the constraint ensures that the outputs are plausible. CD is inspired by the fact that the failures of larger LMs (e.g., repetition, incoherence) are even more prevalent in smaller LMs, and that this difference signals which texts should be preferred. CD requires zero additional training, and produces higher quality text than decoding from the larger LM alone. It also works across model scales (OPT-13B and GPT2-1.5B) and significantly outperforms four strong decoding algorithms (e.g., nucleus, top-k) in automatic and human evaluations across wikipedia, news and story domains.
arXiv.org Artificial Intelligence
Jul-10-2023
- Country:
- South America > Brazil (0.04)
- Oceania > Australia
- North America
- United States
- Montana (0.14)
- New York (0.04)
- Oregon (0.04)
- Rocky Mountains (0.04)
- Idaho (0.04)
- Washington (0.04)
- Texas > Travis County
- Austin (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Colorado > El Paso County
- Colorado Springs (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- San Mateo County > Redwood City (0.04)
- San Joaquin County (0.04)
- Los Angeles County > Los Angeles (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Cuba > La Habana Province
- Havana (0.04)
- Canada
- Rocky Mountains (0.04)
- Northwest Territories (0.04)
- United States
- Europe
- United Kingdom (0.04)
- Netherlands (0.04)
- Germany (0.04)
- Finland > Paijanne Tavastia
- Lahti (0.04)
- Asia
- Africa
- Tanzania (0.04)
- Kenya (0.04)
- Ethiopia (0.04)
- Democratic Republic of the Congo (0.04)
- Namibia > Kavango East
- Rundu (0.04)
- Genre:
- Personal (0.46)
- Research Report (0.40)
- Industry:
- Government > Military (0.93)
- Retail (0.68)
- Transportation
- Air (1.00)
- Infrastructure & Services (0.93)
- Media
- Film (1.00)
- Television (0.67)
- Leisure & Entertainment > Games
- Computer Games (0.93)
- Technology: