Assessing Color Vision Test in Large Vision-language Models

Ye, Hongfei, Chen, Bin, Liu, Wenxi, Zhang, Yu, Li, Zhao, Ni, Dandan, Chen, Hongyang

Jul-16-2025–arXiv.org Artificial Intelligence

With the widespread adoption of large vision-language models, the capacity for color vision in these models is crucial. However, the color vision abilities of large visual-language models have not yet been thoroughly explored. To address this gap, we define a color vision testing task for large vision-language models and construct a dataset \footnote{Anonymous Github Showing some of the data https://anonymous.4open.science/r/color-vision-test-dataset-3BCD} that covers multiple categories of test questions and tasks of varying difficulty levels. Furthermore, we analyze the types of errors made by large vision-language models and propose fine-tuning strategies to enhance their performance in color vision tests.

category, large language model, machine learning, (13 more...)

arXiv.org Artificial Intelligence

Jul-16-2025

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.30)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.98)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language > Large Language Model (0.98)
  - Machine Learning > Neural Networks
    - Deep Learning (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found