Uncovering Regional Defaults from Photorealistic Forests in Text-to-Image Generation with DALL-E 2
Liu, Zilong, Janowicz, Krzysztof, Currier, Kitty, Shi, Meilin
–arXiv.org Artificial Intelligence
Regional defaults describe the emerging phenomenon that text-to-image (T2I) foundation models used in generative AI are prone to over-proportionally depicting certain geographic regions to the exclusion of others. In this work, we introduce a scalable evaluation for uncovering such regional defaults. The evaluation consists of region hierarchy--based image generation and cross-level similarity comparisons. We carry out an experiment by prompting DALL-E 2, a state-of-the-art T2I generation model capable of generating photorealistic images, to depict a forest. We select forest as an object class that displays regional variation and can be characterized using spatial statistics. For a region in the hierarchy, our experiment reveals the regional defaults implicit in DALL-E 2, along with their scale-dependent nature and spatial relationships. In addition, we discover that the implicit defaults do not necessarily correspond to the most widely forested regions in reality. Our findings underscore a need for further investigation into the geography of T2I generation and other forms of generative AI.
arXiv.org Artificial Intelligence
Oct-3-2024
- Country:
- Africa (1.00)
- Europe > Austria (0.29)
- North America > United States (0.28)
- Genre:
- Research Report > New Finding (0.87)
- Technology: