Growing a Tail: Increasing Output Diversity in Large Language Models
Shur-Ofry, Michal, Horowitz-Amsalem, Bar, Rahamim, Adir, Belinkov, Yonatan
–arXiv.org Artificial Intelligence
For large groups, use the name of the group or consortium and include a full list of the authors and affiliations at the end of the main manuscript or in the Supplementary Materials. Abstract: How diverse are the outputs of large language models when diversity is desired? We examine the diversity of responses of various models to questions with multiple possible answers, comparing them with human responses. Our findings suggest that models' outputs are highly concentrated, reflecting a narrow, mainstream'worldview', in comparison to humans, whose responses exhibit a much longer-tail. We examine three ways to increase models' output diversity: 1) increasing generation randomness via temperature sampling; 2) prompting models to answer from diverse perspectives; 3) aggregating outputs from several models. A combination of these measures significantly increases models' output diversity, reaching that of humans. We discuss implications of these findings for AI policy that wishes to preserve cultural diversity, an essential building block of a democratic social fabric. Conversely, a lack of diversity can result in extremism and exclusion (e.g., 1, 2).
arXiv.org Artificial Intelligence
Nov-5-2024
- Country:
- South America
- Peru > Cusco Department
- Cusco Province > Cusco (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Brazil > Rio de Janeiro
- Rio de Janeiro (0.04)
- Argentina > Patagonia
- Tierra del Fuego Province > Ushuaia (0.04)
- Peru > Cusco Department
- Oceania
- New Zealand (0.04)
- Australia (0.04)
- North America
- Canada (0.04)
- United States
- New York (0.05)
- Indiana (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Francisco County > San Francisco (0.04)
- San Diego County > San Diego (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Europe
- Western Europe (0.04)
- Czechia > Prague (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Austria > Salzburg
- Salzburg (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.05)
- Asia
- Singapore (0.04)
- Thailand
- Bangkok > Bangkok (0.04)
- Chiang Mai > Chiang Mai (0.04)
- Middle East
- Republic of Türkiye > Istanbul Province
- Istanbul (0.05)
- Israel
- Jerusalem District > Jerusalem (0.04)
- Haifa District > Haifa (0.04)
- Republic of Türkiye > Istanbul Province
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.05)
- Kansai > Kyoto Prefecture
- Kyoto (0.04)
- Kantō > Tokyo Metropolis Prefecture
- India > NCT
- New Delhi (0.04)
- China
- Africa
- South Africa > Western Cape
- Cape Town (0.04)
- Middle East > Egypt
- Cairo Governorate > Cairo (0.04)
- South Africa > Western Cape
- South America
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Leisure & Entertainment (1.00)
- Government (1.00)
- Media > Television (0.96)
- Technology: