CVQA: Culturally-diverseMultilingual VisualQuestionAnsweringBenchmark

Feb-8-2026, 09:26:38 GMT–Neural Information Processing Systems

Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data.

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Feb-8-2026, 09:26:38 GMT

Conferences PDF

Add feedback

Country:
- South America
  - Uruguay (0.04)
  - Ecuador (0.04)
  - Colombia (0.04)
  - Brazil (0.04)
  - Argentina (0.04)
  - Chile > Santiago Metropolitan Region
    - Santiago Province > Santiago (0.04)
- North America
  - United States > Michigan (0.05)
  - Mexico (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - France (0.04)
  - Russia (0.04)
  - Romania (0.04)
  - Norway (0.04)
  - Bulgaria (0.04)
  - Spain > Galicia
    - Madrid (0.05)
  - Germany
    - Saarland (0.04)
    - Baden-Württemberg > Stuttgart Region
      - Stuttgart (0.04)
- Asia
  - India (0.05)
  - Philippines (0.05)
  - Singapore (0.04)
  - China (0.04)
  - Russia (0.04)
  - Japan (0.04)
  - Malaysia (0.04)
  - Mongolia (0.04)
  - Pakistan (0.04)
  - Middle East
    - UAE (0.05)
    - Israel (0.04)
  - Indonesia
    - Bali (0.04)
    - Java
      - Jakarta > Jakarta (0.04)
      - East Java > Surabaya (0.04)
- Africa
  - Nigeria (0.04)
  - Ethiopia (0.04)
  - Rwanda (0.04)
  - Middle East > Egypt (0.04)
  - Kenya (0.04)

Genre:
- Overview (0.46)
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Large Language Model (0.68)

Duplicate Docs Excel Report

Title
1568882ba1a50316e87852542523739c-Paper-Datasets_and_Benchmarks_Track.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found