SceneCraft: Layout-Guided 3D Scene Generation
–Neural Information Processing Systems
The creation of complex 3D scenes tailored to user specifications has been a tedious and challenging task with traditional 3D modeling tools. Although some pioneering methods have achieved automatic text-to-3D generation, they are generally limited to small-scale scenes with restricted control over the shape and texture. We introduce SceneCraft, a novel method for generating detailed indoor scenes that adhere to textual descriptions and spatial layout preferences provided by users. Central to our method is a rendering-based technique, which converts 3D semantic layouts into multi-view 2D proxy maps. Furthermore, we design a semantic and depth conditioned diffusion model to generate multi-view images, which are used to learn a neural radiance field (NeRF) as the final scene representation.
Neural Information Processing Systems
May-27-2025, 09:31:16 GMT
- Genre:
- Research Report > Promising Solution (0.63)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.62)
- Vision (0.82)
- Information Technology > Artificial Intelligence