The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Ma, Bolei, Wang, Xinpeng, Hu, Tiancheng, Haensch, Anna-Carolina, Hedderich, Michael A., Plank, Barbara, Kreuter, Frauke
–arXiv.org Artificial Intelligence
Recent advances in Large Language Models (LLMs) have sparked wide interest in validating and comprehending the human-like cognitive-behavioral traits LLMs may have. These cognitive-behavioral traits include typically Attitudes, Opinions, Values (AOV). However, measuring AOV embedded within LLMs remains opaque, and different evaluation methods may yield different results. This has led to a lack of clarity on how different studies are related to each other and how they can be interpreted. This paper aims to bridge this gap by providing an overview of recent works on the evaluation of AOV in LLMs. Moreover, we survey related approaches in different stages of the evaluation pipeline in these works. By doing so, we address the potential and challenges with respect to understanding the model, human-AI alignment, and downstream application in social sciences. Finally, we provide practical insights into evaluation methods, model enhancement, and interdisciplinary collaboration, thereby contributing to the evolving landscape of evaluating AOV in LLMs.
arXiv.org Artificial Intelligence
Jul-1-2024
- Country:
- North America
- United States
- New York > New York County
- New York City (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Maryland > Prince George's County
- College Park (0.04)
- California > San Francisco County
- San Francisco (0.04)
- New York > New York County
- Mexico > Mexico City
- Mexico City (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Austria > Vienna (0.14)
- France (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Spain > Galicia
- Madrid (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- Buckinghamshire > Milton Keynes (0.04)
- Latvia > Lubāna Municipality
- Lubāna (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- Singapore (0.05)
- Russia (0.04)
- Middle East
- Israel (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- North America
- Genre:
- Research Report (1.00)
- Questionnaire & Opinion Survey (1.00)
- Overview (1.00)
- Industry:
- Government (0.92)
- Health & Medicine (0.67)
- Technology: