BendVLM: Test-Time Debiasing of Vision-Language Embeddings Walter Gerych 1 Eileen Pan

May-25-2025, 05:03:18 GMT–Neural Information Processing Systems

Vision-language model (VLM) embeddings have been shown to encode biases present in their training data, such as societal biases that prescribe negative characteristics to members of various racial and gender identities. VLMs are being quickly adopted for a variety of tasks ranging from few-shot classification to text-guided image generation, making debiasing VLM embeddings crucial. Debiasing approaches that fine-tune the VLM often suffer from catastrophic forgetting. On the other hand, fine-tuning-free methods typically utilize a "one-size-fits-all" approach that assumes that correlation with the spurious attribute can be explained using a single linear direction across all possible inputs.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

May-25-2025, 05:03:18 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report > Experimental Study (0.46)

Industry:
- Health & Medicine > Therapeutic Area (0.46)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks (0.48)
    - Natural Language > Text Processing (0.46)
    - Vision (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)