Granular Privacy Control for Geolocation with Vision Language Models
Mendes, Ethan, Chen, Yang, Hays, James, Das, Sauvik, Xu, Wei, Ritter, Alan
–arXiv.org Artificial Intelligence
Vision Language Models (VLMs) are rapidly advancing in their capability to answer information-seeking questions. As these models are widely deployed in consumer applications, they could lead to new privacy risks due to emergent abilities to identify people in photos, geolocate images, etc. As we demonstrate, somewhat surprisingly, current open-source and proprietary VLMs are very capable image geolocators, making widespread geolocation with VLMs an immediate privacy risk, rather than merely a theoretical future concern. As a first step to address this challenge, we develop a new benchmark, GPTGeoChat, to test the ability of VLMs to moderate geolocation dialogues with users. We collect a set of 1,000 image geolocation conversations between in-house annotators and GPT-4v, which are annotated with the granularity of location information revealed at each turn. Using this new dataset, we evaluate the ability of various VLMs to moderate GPT-4v geolocation conversations by determining when too much location information has been revealed. We find that custom fine-tuned models perform on par with prompted API-based models when identifying leaked location information at the country or city level; however, fine-tuning on supervised data appears to be needed to accurately moderate finer granularities, such as the name of a restaurant or building.
arXiv.org Artificial Intelligence
Jul-6-2024
- Country:
- Indian Ocean > Arabian Gulf (0.04)
- South America
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay (0.04)
- Oceania > Australia
- Western Australia > Perth (0.04)
- North America
- Central America (0.04)
- United States
- New York (0.14)
- Minnesota (0.05)
- Wisconsin (0.05)
- Michigan (0.04)
- Indiana (0.04)
- Nevada (0.04)
- Ohio (0.04)
- Maine (0.04)
- Utah (0.04)
- Missouri (0.04)
- South Dakota (0.04)
- New Hampshire (0.04)
- North Dakota (0.04)
- Vermont (0.04)
- Colorado (0.04)
- Massachusetts (0.04)
- Montana (0.04)
- Texas
- Denton County (0.04)
- Dallas County > Irving (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New Jersey > Mercer County
- Trenton (0.04)
- Illinois > Kane County
- Geneva (0.14)
- California
- San Francisco County > San Francisco (0.04)
- Santa Clara County > Santa Clara (0.04)
- Orange County > Orange (0.04)
- Los Angeles County (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Costa Rica > San José Province
- San José (0.04)
- Canada > British Columbia
- Europe
- Italy (0.14)
- Austria (0.04)
- Iceland (0.04)
- Switzerland (0.04)
- Finland (0.04)
- Norway (0.04)
- Russia (0.04)
- Latvia (0.04)
- Lithuania (0.04)
- Estonia (0.04)
- Sweden (0.04)
- Ireland (0.04)
- Middle East (0.04)
- Eastern Europe (0.04)
- United Kingdom > Scotland (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Germany
- Bavaria (0.04)
- Baden-Württemberg (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- France > Île-de-France
- Asia
- China (0.14)
- Russia (0.04)
- Mongolia (0.04)
- South Korea (0.04)
- Japan > Hokkaidō (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East
- Saudi Arabia > Arabian Gulf (0.04)
- Bahrain > Capital Governorate
- Manama (0.05)
- Africa
- West Africa (0.04)
- Middle East (0.04)
- Ghana > Greater Accra
- Accra (0.04)
- Genre:
- Research Report (0.50)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.93)
- Leisure & Entertainment (0.92)
- Technology:
- Information Technology
- Communications > Social Media (1.00)
- Artificial Intelligence
- Vision (1.00)
- Natural Language
- Large Language Model (1.00)
- Chatbot (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology