Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects

Chiatti, Agnese, Bernardini, Sara, Piccolo, Lara Shibelski Godoy, Schiaffonati, Viola, Matteucci, Matteo

May-9-2025–arXiv.org Artificial Intelligence

The rapid adoption of Vision Language Models (VLMs), pre-trained on large image-text and video-text datasets, calls for protecting and informing users about when to trust these systems. This survey reviews studies on trust dynamics in user-VLM interactions, through a multi-disciplinary taxonomy encompassing different cognitive science capabilities, collaboration modes, and agent behaviours. Literature insights and findings from a workshop with prospective VLM users inform preliminary requirements for future VLM trust studies.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

May-9-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Germany (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.14)
  - Italy > Lombardy
    - Milan (0.04)

Genre:
- Overview (1.00)

Industry:
- Education (0.93)
- Health & Medicine > Therapeutic Area (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Natural Language > Large Language Model (0.71)
  - Issues > Social & Ethical Issues (0.68)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found