Chart Question Answering from Real-World Analytical Narratives
Hutchinson, Maeve, Jianu, Radu, Slingsby, Aidan, Wood, Jo, Madhyastha, Pranava
–arXiv.org Artificial Intelligence
We present a new dataset for chart question answering (CQA) constructed from visualization notebooks. The dataset features real-world, multi-view charts paired with natural language questions grounded in analytical narratives. Unlike prior benchmarks, our data reflects ecologically valid reasoning workflows. Benchmarking state-of-the-art multimodal large language models reveals a significant performance gap, with GPT-4.1 achieving an accuracy of 69.3%, underscoring the challenges posed by this more authentic CQA setting.
arXiv.org Artificial Intelligence
Jul-3-2025
- Country:
- Asia
- Europe
- France (0.06)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom
- Northern Ireland (0.04)
- Scotland (0.04)
- Wales (0.06)
- North America
- Canada > British Columbia
- Vancouver (0.04)
- United States
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Washington > King County
- Seattle (0.04)
- New Mexico > Bernalillo County
- Canada > British Columbia
- Genre:
- Research Report (0.82)
- Industry:
- Education (0.30)
- Technology: