PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding Projection

Molahasani, Mahdiyar, Motamedi, Azadeh, Greenspan, Michael, Kim, Il-Min, Etemad, Ali

Jul-15-2025–arXiv.org Artificial Intelligence

W e introduce Projection-based Reduction of Implicit Spurious bias in vision-language Models (PRISM), a new data-free and task-agnostic solution for bias mitigation in VLMs like CLIP . VLMs often inherit and amplify biases in their training data, leading to skewed predictions. PRISM is designed to debias VLMs without relying on predefined bias categories or additional external data. It operates in two stages: first, an LLM is prompted with simple class prompts to generate scene descriptions that contain spurious correlations. Next, PRISM uses our novel contrastive-style debi-asing loss to learn a projection that maps the embeddings onto a latent space that minimizes spurious correlations while preserving the alignment between image and text em-beddings. Extensive experiments demonstrate that PRISM outperforms current debiasing methods on the commonly used W aterbirds and CelebA datasets W e make our code public at: https://github.com/MahdiyarMM/

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Jul-15-2025

arXiv.org PDF

Add feedback

Country:
- Africa > Guinea
  - Kankan Region > Kankan Prefecture > Kankan (0.04)
- North America > Canada (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.93)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found