Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects
Fime, Awal Ahmed, Mahmud, Saifuddin, Das, Arpita, Islam, Md. Sunzidul, Kim, Hong-Hoon
–arXiv.org Artificial Intelligence
Automatic scene generation is an essential area of research with applications in robotics, recreation, visual representation, training and simulation, education, and more. This survey provides a comprehensive review of the current state-of-the-arts in automatic scene generation, focusing on techniques that leverage machine learning, deep learning, embedded systems, and natural language processing (NLP). We categorize the models into four main types: Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), Transformers, and Diffusion Models. Each category is explored in detail, discussing various sub-models and their contributions to the field. We also review the most commonly used datasets, such as COCO-Stuff, Visual Genome, and MS-COCO, which are critical for training and evaluating these models. Methodologies for scene generation are examined, including image-to-3D conversion, text-to-3D generation, UI/layout design, graph-based methods, and interactive scene generation. Evaluation metrics such as Frechet Inception Distance (FID), Kullback-Leibler (KL) Divergence, Inception Score (IS), Intersection over Union (IoU), and Mean Average Precision (mAP) are discussed in the context of their use in assessing model performance. The survey identifies key challenges and limitations in the field, such as maintaining realism, handling complex scenes with multiple objects, and ensuring consistency in object relationships and spatial arrangements. By summarizing recent advances and pinpointing areas for improvement, this survey aims to provide a valuable resource for researchers and practitioners working on automatic scene generation.
arXiv.org Artificial Intelligence
Sep-14-2024
- Country:
- North America > United States
- New York (0.04)
- Ohio > Portage County
- Kent (0.04)
- Illinois > Peoria County
- Peoria (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Europe
- Greece (0.04)
- United Kingdom > Wales
- Cardiff (0.04)
- Switzerland > Zürich
- Zürich (0.13)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Germany > Baden-Württemberg
- Karlsruhe Region > Karlsruhe (0.04)
- Asia
- Bangladesh (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Japan > Honshū
- Chūbu
- Nagano Prefecture > Nagano (0.04)
- Ishikawa Prefecture > Kanazawa (0.04)
- Chūbu
- China > Beijing
- Beijing (0.04)
- North America > United States
- Genre:
- Research Report > Promising Solution (1.00)
- Overview (1.00)
- Industry:
- Media (1.00)
- Education (1.00)
- Health & Medicine (0.67)
- Information Technology > Security & Privacy (0.67)
- Leisure & Entertainment > Games
- Computer Games (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Vision > Image Understanding (1.00)
- Representation & Reasoning > Object-Oriented Architecture (1.00)
- Natural Language
- Text Processing (1.00)
- Large Language Model (1.00)
- Machine Learning
- Unsupervised or Indirectly Supervised Learning (1.00)
- Statistical Learning (1.00)
- Performance Analysis (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence