Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

Jun-18-2026, 13:41:52 GMT–Neural Information Processing Systems

Large language models (LMs) often struggle to generate diverse, human-like creative content, raising concerns about the long-term homogenization of human thought through repeated exposure to similar outputs. Yet scalable methods for evaluating LM output diversity remain limited, especially beyond narrow tasks such as random number or name generation, or beyond repeated sampling from a single model. To address this gap, we introduce INFINITY-CHAT, a largescale dataset of 26K diverse, real-world, open-ended user queries that admit a wide range of plausible answers with no single ground truth. We introduce the first comprehensive taxonomy for characterizing the full spectrum of open-ended prompts posed to LMs, comprising 6 top-level categories (e.g., creative content generation, brainstorm & ideation) that further breaks down to 17 subcategories.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Jun-18-2026, 13:41:52 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (1.00)
- Asia (1.00)

Genre:
- Overview (0.92)
- Personal (0.67)
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Law (1.00)
- Materials (1.00)
- Banking & Finance > Economy (1.00)
- Automobiles & Trucks (1.00)
- Education > Educational Setting (0.92)
- Media (0.92)
- Food & Agriculture (0.67)
- Energy > Renewable (0.67)
- Government > Regional Government
  - North America Government > United States Government (0.67)
- Leisure & Entertainment > Sports
  - Motorsports (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found