WhodunitBench: Evaluating Large Multimodal Agents via Murder Mystery Games

Feb-16-2026, 23:51:31 GMT–Neural Information Processing Systems

Recently, large language models (LLMs) have achieved superior performance, empowering the development of large multimodal agents (LMAs).

artificial intelligence, large language model, natural language, (17 more...)

Neural Information Processing Systems

Feb-16-2026, 23:51:31 GMT

Conferences PDF

Country:
- Europe > Iceland
  - Capital Region > Reykjavik (0.04)
- Asia > China
  - Guangdong Province > Shenzhen (0.04)
  - Hong Kong (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Law (1.00)
- Information Technology (1.00)
- Leisure & Entertainment > Games
  - Computer Games (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning > Agents (0.93)