VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Hall, Siobhan Mackenzie, Abrantes, Fernanda Gonçalves, Zhu, Hanwen, Sodunke, Grace, Shtedritski, Aleksandar, Kirk, Hannah Rose
–arXiv.org Artificial Intelligence
We introduce VisoGender, a novel dataset for benchmarking gender bias in vision-language models. We focus on occupation-related biases within a hegemonic system of binary gender, inspired by Winograd and Winogender schemas, where each image is associated with a caption containing a pronoun relationship of subjects and objects in the scene. VisoGender is balanced by gender representation in professional roles, supporting bias evaluation in two ways: i) resolution bias, where we evaluate the difference between pronoun resolution accuracies for image subjects with gender presentations perceived as masculine versus feminine by human annotators and ii) retrieval bias, where we compare ratios of professionals perceived to have masculine and feminine gender presentations retrieved for a gender-neutral search query. We benchmark several state-of-the-art vision-language models and find that they demonstrate bias in resolving binary gender in complex scenes. While the direction and magnitude of gender bias depends on the task and the model being evaluated, captioning models are generally less biased than Vision-Language Encoders. Dataset and code are available at https://github.com/oxai/visogender
arXiv.org Artificial Intelligence
Dec-12-2023
- Country:
- Asia > Middle East
- Israel (0.14)
- Europe > Switzerland
- North America > United States
- Louisiana (0.14)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Government (0.67)
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (0.45)
- Natural Language > Large Language Model (0.46)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Information Management > Search (0.87)
- Artificial Intelligence
- Information Technology