ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Masry, Ahmed, Thakkar, Megh, Bajaj, Aayush, Kartha, Aaryaman, Hoque, Enamul, Joty, Shafiq
–arXiv.org Artificial Intelligence
Given the ubiquity of charts as a data analysis, visualization, and decision-making tool across industries and sciences, there has been a growing interest in developing pre-trained foundation models as well as general purpose instruction-tuned models for chart understanding and reasoning. However, existing methods suffer crucial drawbacks across two critical axes affecting the performance of chart representation models: they are trained on data generated from underlying data tables of the charts, ignoring the visual trends and patterns in chart images, and use weakly aligned vision-language backbone models for domain-specific training, limiting their generalizability when encountering charts in the wild. We address these important drawbacks and introduce ChartGemma, a novel chart understanding and reasoning model developed over PaliGemma. Rather than relying on underlying data tables, ChartGemma is trained on instruction-tuning data generated directly from chart images, thus capturing both high-level trends and low-level visual information from a diverse set of charts. Our simple approach achieves state-of-the-art results across $5$ benchmarks spanning chart summarization, question answering, and fact-checking, and our elaborate qualitative studies on real-world charts show that ChartGemma generates more realistic and factually correct summaries compared to its contemporaries. We release the code, model checkpoints, dataset, and demos at https://github.com/vis-nlp/ChartGemma.
arXiv.org Artificial Intelligence
Jul-4-2024
- Country:
- Asia
- Japan (0.04)
- Middle East > Republic of Türkiye (0.04)
- Singapore (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Croatia (0.04)
- Greece (0.04)
- Hungary (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Trentino-Alto Adige/Südtirol
- Trentino Province > Trento (0.04)
- United Kingdom
- Belgium > Brussels-Capital Region
- North America
- Canada > Quebec (0.04)
- United States
- California
- San Francisco County > San Francisco (0.04)
- Santa Clara County > Palo Alto (0.04)
- Pennsylvania (0.04)
- California
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay (0.04)
- Asia
- Genre:
- Research Report (0.82)
- Industry:
- Banking & Finance (1.00)
- Health & Medicine > Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (0.93)
- Technology: