Human Preferences for Constructive Interactions in Language Model Alignment

Kyrychenko, Yara, Roozenbeek, Jon, Davidson, Brandon, van der Linden, Sander, Debnath, Ramit

Mar-5-2025–arXiv.org Artificial Intelligence

As large language models (LLMs) enter the mainstream, aligning them to foster constructive dialogue rather than exacerbate societal divisions is critical. Using an individualized and multicultural alignment dataset of over 7,500 conversations of individuals from 74 countries engaging with 21 LLMs, we examined how linguistic attributes linked to constructive interactions are reflected in human preference data used for training AI. We found that users consistently preferred well-reasoned and nuanced responses while rejecting those high in personal storytelling. However, users who believed that AI should reflect their values tended to place less preference on reasoning in LLM responses and more on curiosity. Encouragingly, we observed that users could set the tone for how constructive their conversation would be, as LLMs mirrored linguistic attributes, including toxicity, in user queries.

controversy, inf 1, llm response, (11 more...)

arXiv.org Artificial Intelligence

Mar-5-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Minnesota > Hennepin County > Minneapolis (0.14)
- Europe > United Kingdom
  - England
    - Cambridgeshire > Cambridge (0.29)
    - Greater London > London (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found