Faithful Chart Summarization with ChaTS-Pi

Krichene, Syrine, Piccinno, Francesco, Liu, Fangyu, Eisenschlos, Julian Martin

May-29-2024–arXiv.org Artificial Intelligence

Chart-to-summary generation can help explore data, communicate insights, and help the visually impaired people. Multi-modal generative models have been used to produce fluent summaries, but they can suffer from factual and perceptual errors. In this work we present CHATS-CRITIC, a reference-free chart summarization metric for scoring faithfulness. CHATS-CRITIC is composed of an image-to-text model to recover the table from a chart, and a tabular entailment model applied to score the summary sentence by sentence. We find that CHATS-CRITIC evaluates the summary quality according to human ratings better than reference-based metrics, either learned or n-gram based, and can be further used to fix candidate summaries by removing not supported sentences. We then introduce CHATS-PI, a chart-to-summary pipeline that leverages CHATS-CRITIC during inference to fix and rank sampled candidates from any chart-summarization model. We evaluate CHATS-PI and CHATS-CRITIC using human raters, establishing state-of-the-art results on two popular chart-to-summary datasets.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

May-29-2024

arXiv.org PDF

Add feedback

Country:
- Europe (1.00)
- North America > United States
  - Washington > King County > Seattle (0.14)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area (0.70)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)
  - Natural Language
    - Grammars & Parsing (0.46)
    - Large Language Model (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found