ChatBCG: Can AI Read Your Slide Deck?

Singh, Nikita, Balian, Rob, Martinelli, Lukas

Jul-16-2024–arXiv.org Artificial Intelligence

With the advanced vision capabilities of GPT-4o and Gemini Flash, an important question arises regarding the accuracy of these functionalities in practical business applications. Our assumption was that multimodal models are good at reading and summarizing charts. When given an image of a slide deck, they do a good job of summarizing key insights from it, often including relevant data points. Existing research into this question has evaluated the efficacy of LLM's when parsing tables [3], concluding that the LLMs were highly sensitive to input prompts which drive performance. Other works also evaluate LLMs ability to reason and read mathematical graphs [2] and find that GPT models outperform alternatives. This paper aims to explore whether multimodal models perform well on a variant of this skill - answering straightforward questions that require the models to pick out a number from a slide deck.

mean absolute error, mean absolute percentage error, slide deck, (13 more...)

arXiv.org Artificial Intelligence

Jul-16-2024

arXiv.org PDF

Add feedback

Country:
- Europe > France (0.04)
- South America > Brazil (0.04)
- North America > United States (0.04)
- Africa (0.04)
- Asia
  - Japan (0.04)
  - Middle East > Israel (0.04)
  - India (0.04)

Genre:
- Research Report (0.83)
- Questionnaire & Opinion Survey (0.70)

Industry:
- Banking & Finance (1.00)
- Health & Medicine > Therapeutic Area
  - Infections and Infectious Diseases (0.30)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.90)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found