ChartCitor: Multi-Agent Framework for Fine-Grained Chart Visual Attribution
Goswami, Kanika, Mathur, Puneet, Rossi, Ryan, Dernoncourt, Franck
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) can perform chart question-answering tasks but often generate unverified hallucinated responses. Existing answer attribution methods struggle to ground responses in source charts due to limited visual-semantic context, complex visual-text alignment requirements, and difficulties in bounding box prediction across complex layouts. We present ChartCitor, a multi-agent framework that provides fine-grained bounding box citations by identifying supporting evidence within chart images. The system orchestrates LLM agents to perform chart-to-table extraction, answer reformulation, table augmentation, evidence retrieval through pre-filtering and re-ranking, and table-to-chart mapping. ChartCitor outperforms existing baselines across different chart types. Qualitative user studies show that ChartCitor helps increase user trust in Generative AI by providing enhanced explainability for LLM-assisted chart QA and enables professionals to be more productive.
arXiv.org Artificial Intelligence
Feb-2-2025
- Country:
- North America
- United States
- District of Columbia > Washington (0.05)
- New York > New York County
- New York City (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Asia
- Singapore (0.05)
- Indonesia > Bali (0.05)
- Middle East > Jordan (0.04)
- India (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- North America
- Genre:
- Research Report (0.40)
- Technology: