Growing a Tail: Increasing Output Diversity in Large Language Models

Shur-Ofry, Michal, Horowitz-Amsalem, Bar, Rahamim, Adir, Belinkov, Yonatan

Nov-5-2024–arXiv.org Artificial Intelligence

For large groups, use the name of the group or consortium and include a full list of the authors and affiliations at the end of the main manuscript or in the Supplementary Materials. Abstract: How diverse are the outputs of large language models when diversity is desired? We examine the diversity of responses of various models to questions with multiple possible answers, comparing them with human responses. Our findings suggest that models' outputs are highly concentrated, reflecting a narrow, mainstream'worldview', in comparison to humans, whose responses exhibit a much longer-tail. We examine three ways to increase models' output diversity: 1) increasing generation randomness via temperature sampling; 2) prompting models to answer from diverse perspectives; 3) aggregating outputs from several models. A combination of these measures significantly increases models' output diversity, reaching that of humans. We discuss implications of these findings for AI policy that wishes to preserve cultural diversity, an essential building block of a democratic social fabric. Conversely, a lack of diversity can result in extremism and exclusion (e.g., 1, 2).

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

Nov-5-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Israel (0.28)
- Europe (1.00)
- North America > United States
  - California (0.28)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Government > Regional Government (0.68)
- Leisure & Entertainment (1.00)
- Media > Television (0.96)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)