EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models

Hu, He, Zhou, Yucheng, You, Lianzhong, Xu, Hongbo, Wang, Qianning, Lian, Zheng, Yu, Fei Richard, Ma, Fei, Cui, Laizhong

Feb-6-2025–arXiv.org Artificial Intelligence

With the integration of Multimodal large language models (MLLMs) into robotic systems and various AI applications, embedding emotional intelligence (EI) capabilities into these models is essential for enabling robots to effectively address human emotional needs and interact seamlessly in real-world scenarios. Existing static, text-based, or text-image benchmarks overlook the multimodal complexities of real-world interactions and fail to capture the dynamic, multimodal nature of emotional expressions, making them inadequate for evaluating MLLMs' EI. Based on established psychological theories of EI, we build EmoBench-M, a novel benchmark designed to evaluate the EI capability of MLLMs across 13 valuation scenarios from three key dimensions: foundational emotion recognition, conversational emotion understanding, and socially complex emotion analysis. Evaluations of both open-source and closed-source MLLMs on EmoBench-M reveal a significant performance gap between them and humans, highlighting the need to further advance their EI capabilities. All benchmark resources, including code and datasets, are publicly available at https://emo-gml.github.io/.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Feb-6-2025

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Oceania
  - Australia (0.04)
  - New Zealand > North Island
    - Auckland Region > Auckland (0.04)
- North America
  - United States
    - Washington > King County
      - Seattle (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - National Capital Region > Ottawa (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Italy > Lombardy
    - Milan (0.04)
- Asia
  - Singapore (0.04)
  - Macao (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - India > Karnataka
    - Bengaluru (0.04)
  - China
    - Hong Kong (0.04)
    - Guangdong Province > Shenzhen (0.04)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Research Report > New Finding (0.92)

Industry:
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.61)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Cognitive Science > Emotion (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found