Chart Question Answering from Real-World Analytical Narratives

Hutchinson, Maeve, Jianu, Radu, Slingsby, Aidan, Wood, Jo, Madhyastha, Pranava

Jul-3-2025–arXiv.org Artificial Intelligence

We present a new dataset for chart question answering (CQA) constructed from visualization notebooks. The dataset features real-world, multi-view charts paired with natural language questions grounded in analytical narratives. Unlike prior benchmarks, our data reflects ecologically valid reasoning workflows. Benchmarking state-of-the-art multimodal large language models reveals a significant performance gap, with GPT-4.1 achieving an accuracy of 69.3%, underscoring the challenges posed by this more authentic CQA setting.

large language model, machine learning, question answering, (20 more...)

arXiv.org Artificial Intelligence

Jul-3-2025

arXiv.org PDF

Add feedback

Country:
- Europe (0.97)
- Asia (0.69)
- North America > United States (0.68)

Genre:
- Research Report (0.82)

Industry:
- Education (0.30)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (0.91)
    - Question Answering (0.89)
  - Machine Learning > Neural Networks
    - Deep Learning (0.38)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found