How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games

Xie, Yutong, Liu, Yiyao, Ma, Zhuang, Shi, Lin, Wang, Xiyuan, Yuan, Walter, Jackson, Matthew O., Mei, Qiaozhu

Dec-16-2024–arXiv.org Artificial Intelligence

The deployment of large language models (LLMs) in diverse applications requires a thorough understanding of their decision-making strategies and behavioral patterns. As a supplement to a recent study on the behavioral Turing test, this paper presents a comprehensive analysis of five leading LLM-based chatbot families as they navigate a series of behavioral economics games. By benchmarking these AI chatbots, we aim to uncover and document both common and distinct behavioral patterns across a range of scenarios. The findings provide valuable insights into the strategic preferences of each LLM, highlighting potential implications for their deployment in critical decision-making roles.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Dec-16-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.46)
  - Michigan (0.29)

Genre:
- Research Report (0.84)

Industry:
- Leisure & Entertainment > Games (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language
    - Chatbot (1.00)
    - Large Language Model (1.00)