Uncovering Regional Defaults from Photorealistic Forests in Text-to-Image Generation with DALL-E 2
Liu, Zilong, Janowicz, Krzysztof, Currier, Kitty, Shi, Meilin
–arXiv.org Artificial Intelligence
Regional defaults describe the emerging phenomenon that text-to-image (T2I) foundation models used in generative AI are prone to over-proportionally depicting certain geographic regions to the exclusion of others. In this work, we introduce a scalable evaluation for uncovering such regional defaults. The evaluation consists of region hierarchy--based image generation and cross-level similarity comparisons. We carry out an experiment by prompting DALL-E 2, a state-of-the-art T2I generation model capable of generating photorealistic images, to depict a forest. We select forest as an object class that displays regional variation and can be characterized using spatial statistics. For a region in the hierarchy, our experiment reveals the regional defaults implicit in DALL-E 2, along with their scale-dependent nature and spatial relationships. In addition, we discover that the implicit defaults do not necessarily correspond to the most widely forested regions in reality. Our findings underscore a need for further investigation into the geography of T2I generation and other forms of generative AI.
arXiv.org Artificial Intelligence
Oct-3-2024
- Country:
- Africa
- East Africa (0.04)
- Eritrea (0.04)
- North Africa (0.05)
- South Sudan (0.04)
- Southern Africa (0.04)
- Sudan (0.04)
- Uganda (0.05)
- Antarctica (0.05)
- Asia
- India (0.04)
- Russia (0.04)
- Southeast Asia (0.04)
- Europe
- Austria > Vienna (0.14)
- Eastern Europe (0.04)
- Russia (0.04)
- North America
- Central America (0.06)
- Sint Maarten (0.04)
- United States > California
- Santa Barbara County > Santa Barbara (0.04)
- Oceania > Nauru (0.05)
- South America (0.17)
- Africa
- Genre:
- Research Report > New Finding (0.87)
- Technology: