Mind the (Language) Gap: Towards Probing Numerical and Cross-Lingual Limits of LVLMs
Gautam, Somraj, Penamakuri, Abhirama Subramanyam, Bhandari, Abhishek, Harit, Gaurav
–arXiv.org Artificial Intelligence
We introduce MMCRICBENCH-3K, a benchmark for Visual Question Answering (VQA) on cricket scorecards, designed to evaluate large vision-language models (LVLMs) on complex numerical and cross-lingual reasoning over semi-structured tabular images. MMCRICBENCH-3K comprises 1,463 synthetically generated scorecard images from ODI, T20, and Test formats, accompanied by 1,500 English QA pairs. It includes two subsets: MMCRICBENCH-E-1.5K, featuring English scorecards, and MMCRICBENCH-H-1.5K, containing visually similar Hindi scorecards, with all questions and answers kept in English to enable controlled cross-script evaluation. The task demands reasoning over structured numerical data, multi-image context, and implicit domain knowledge. Empirical results show that even state-of-the-art LVLMs, such as GPT-4o and Qwen2.5VL, struggle on the English subset despite it being their primary training language and exhibit a further drop in performance on the Hindi subset. This reveals key limitations in structure-aware visual text understanding, numerical reasoning, and cross-lingual generalization. The dataset is publicly available via Hugging Face at https://huggingface.co/datasets/DIALab/MMCricBench, to promote LVLM research in this direction.
arXiv.org Artificial Intelligence
Aug-27-2025
- Country:
- Africa
- South Africa (0.04)
- Zimbabwe (0.04)
- Asia
- Afghanistan (0.04)
- Bangladesh (0.04)
- China (0.04)
- India
- Gujarat (0.04)
- Karnataka > Bengaluru (0.04)
- Maharashtra > Mumbai (0.04)
- Rajasthan (0.04)
- Tamil Nadu > Chennai (0.04)
- West Bengal > Kolkata (0.04)
- Pakistan
- Islamabad Capital Territory > Islamabad (0.04)
- Punjab > Lahore Division
- Lahore (0.04)
- Sindh > Karachi Division
- Karachi (0.04)
- Sri Lanka (0.04)
- Europe
- Ireland (0.04)
- Netherlands (0.04)
- United Kingdom > England (0.04)
- Oceania
- Australia (0.04)
- New Zealand (0.04)
- Africa
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Leisure & Entertainment > Sports > Cricket (1.00)
- Technology: