Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects
Chiatti, Agnese, Bernardini, Sara, Piccolo, Lara Shibelski Godoy, Schiaffonati, Viola, Matteucci, Matteo
–arXiv.org Artificial Intelligence
The rapid adoption of Vision Language Models (VLMs), pre-trained on large image-text and video-text datasets, calls for protecting and informing users about when to trust these systems. This survey reviews studies on trust dynamics in user-VLM interactions, through a multi-disciplinary taxonomy encompassing different cognitive science capabilities, collaboration modes, and agent behaviours. Literature insights and findings from a workshop with prospective VLM users inform preliminary requirements for future VLM trust studies.
arXiv.org Artificial Intelligence
May-9-2025
- Country:
- Europe
- Germany (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Italy > Lombardy
- Milan (0.04)
- Europe
- Genre:
- Overview (1.00)
- Industry:
- Education (0.93)
- Health & Medicine > Therapeutic Area (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Representation & Reasoning (1.00)
- Natural Language > Large Language Model (0.71)
- Issues > Social & Ethical Issues (0.68)
- Machine Learning > Neural Networks
- Deep Learning (0.46)
- Information Technology > Artificial Intelligence