Forcing Diffuse Distributions out of Language Models
Zhang, Yiming, Schwarzschild, Avi, Carlini, Nicholas, Kolter, Zico, Ippolito, Daphne
–arXiv.org Artificial Intelligence
Despite being trained specifically to follow user instructions, today's language models perform poorly when instructed to produce random outputs. For example, when prompted to pick a number uniformly between one and ten Llama-2-13B-chat disproportionately favors the number five, and when tasked with picking a first name at random, Mistral-7B-Instruct chooses Avery 40 times more often than we would expect based on the U.S. population. When these language models are used for real-world tasks where diversity of outputs is crucial, such as language model assisted dataset construction, their inability to produce diffuse distributions over valid choices is a major hurdle. In this work, we propose a fine-tuning method that encourages language models to output distributions that are diffuse over valid outcomes. The methods we introduce generalize across a variety of tasks and distributions and make large language models practical for synthetic dataset generation with little human intervention.
arXiv.org Artificial Intelligence
Apr-16-2024
- Country:
- Africa
- Kenya > Nairobi City County
- Nairobi (0.04)
- Middle East (0.04)
- Kenya > Nairobi City County
- Asia
- India > Maharashtra
- Mumbai (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Middle East
- Qatar > Al Rayyan (0.04)
- Saudi Arabia > Mecca Province
- Mecca (0.04)
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- India > Maharashtra
- Europe
- Croatia > Primorje-Gorski Kotar County
- Rijeka (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- France > Île-de-France
- Hungary > Budapest
- Budapest (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Liechtenstein > Vaduz
- Vaduz (0.04)
- Middle East (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Croatia > Primorje-Gorski Kotar County
- North America > United States
- California > Orange County
- Laguna Beach (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > Orange County
- Oceania > Australia
- Queensland > Townsville (0.04)
- South America
- Colombia > Meta Department
- Villavicencio (0.04)
- Uruguay > Montevideo
- Montevideo (0.04)
- Colombia > Meta Department
- Africa
- Genre:
- Research Report (1.00)
- Industry:
- Energy (0.68)
- Government (0.68)
- Media (0.46)
- Technology: