Chart Question Answering from Real-World Analytical Narratives
Hutchinson, Maeve, Jianu, Radu, Slingsby, Aidan, Wood, Jo, Madhyastha, Pranava
–arXiv.org Artificial Intelligence
We present a new dataset for chart question answering (CQA) constructed from visualization notebooks. The dataset features real-world, multi-view charts paired with natural language questions grounded in analytical narratives. Unlike prior benchmarks, our data reflects ecologically valid reasoning workflows. Benchmarking state-of-the-art multimodal large language models reveals a significant performance gap, with GPT-4.1 achieving an accuracy of 69.3%, underscoring the challenges posed by this more authentic CQA setting.
arXiv.org Artificial Intelligence
Jul-3-2025
- Country:
- Asia (0.69)
- Europe (0.97)
- North America > United States (0.68)
- Genre:
- Research Report (0.82)
- Industry:
- Education (0.30)
- Technology: