CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark David Romero
–Neural Information Processing Systems
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data.
Neural Information Processing Systems
Nov-14-2025, 01:08:21 GMT
- Country:
- Africa
- Asia
- Europe
- Ireland (0.04)
- Romania (0.04)
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- Russia (0.04)
- France (0.04)
- Norway (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany
- Baden-Württemberg > Stuttgart Region
- Stuttgart (0.04)
- Hesse > Darmstadt Region
- Darmstadt (0.04)
- Saarland (0.04)
- Baden-Württemberg > Stuttgart Region
- Bulgaria (0.04)
- Spain > Galicia
- Madrid (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- Central America (0.04)
- Dominican Republic (0.04)
- Mexico (0.04)
- United States > Michigan (0.04)
- Canada > Ontario
- South America
- Argentina (0.04)
- Brazil (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Colombia (0.04)
- Ecuador (0.04)
- Uruguay (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Technology: