IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Zhuang, Yiyu, Lv, Jiaxi, Wen, Hao, Shuai, Qing, Zeng, Ailing, Zhu, Hao, Chen, Shifeng, Yang, Yujiu, Cao, Xun, Liu, Wei
–arXiv.org Artificial Intelligence
Creating a high-fidelity, animatable 3D full-body avatar from a single image is a challenging task due to the diverse appearance and poses of humans and the limited availability of high-quality training data. To achieve fast and high-quality human reconstruction, this work rethinks the task from the perspectives of dataset, model, and representation. First, we introduce a large-scale HUman-centric GEnerated dataset, HuGe100K, consisting of 100K diverse, photorealistic sets of human images. Each set contains 24-view frames in specific human poses, generated using a pose-controllable image-to-multi-view model. Next, leveraging the diversity in views, poses, and appearances within HuGe100K, we develop a scalable feed-forward transformer model to predict a 3D human Gaussian representation in a uniform space from a given human image. This model is trained to disentangle human pose, body shape, clothing geometry, and texture. The estimated Gaussians can be animated without post-processing. We conduct comprehensive experiments to validate the effectiveness of the proposed dataset and method. Our model demonstrates the ability to efficiently reconstruct photorealistic humans at 1K resolution from a single input image using a single GPU instantly. Additionally, it seamlessly supports various applications, as well as shape and texture editing tasks.
arXiv.org Artificial Intelligence
Dec-19-2024
- Country:
- South America
- Oceania
- Solomon Islands (0.04)
- Papua New Guinea (0.04)
- New Zealand (0.04)
- Fiji (0.04)
- Australia (0.04)
- North America
- Canada (0.04)
- Trinidad and Tobago (0.04)
- Panama (0.04)
- Costa Rica (0.04)
- Puerto Rico (0.04)
- Dominican Republic (0.04)
- Honduras (0.04)
- Belize (0.04)
- Nicaragua (0.04)
- Guatemala (0.04)
- Mexico (0.04)
- Jamaica (0.04)
- United States (0.04)
- El Salvador (0.04)
- Haiti (0.04)
- Cuba (0.04)
- Europe
- Poland (0.04)
- Germany (0.04)
- Spain (0.04)
- Netherlands (0.04)
- Switzerland (0.04)
- France (0.04)
- Italy (0.04)
- Belgium (0.04)
- Sweden (0.04)
- Asia
- India (0.04)
- Tajikistan (0.04)
- Singapore (0.04)
- Thailand (0.04)
- Turkmenistan (0.04)
- Sri Lanka (0.04)
- Kyrgyzstan (0.04)
- Myanmar (0.04)
- South Korea (0.04)
- Philippines (0.04)
- Laos (0.04)
- Indonesia (0.04)
- Vietnam (0.04)
- Bhutan (0.04)
- Bangladesh (0.04)
- Uzbekistan (0.04)
- Kazakhstan (0.04)
- North Korea (0.04)
- Malaysia (0.04)
- Mongolia (0.04)
- Nepal (0.04)
- Cambodia (0.04)
- Brunei (0.04)
- Pakistan (0.04)
- China
- Guangdong Province > Shenzhen (0.04)
- Jiangsu Province > Nanjing (0.04)
- Middle East
- Israel (0.04)
- Saudi Arabia (0.04)
- Lebanon (0.04)
- Jordan (0.04)
- Kuwait (0.04)
- Iran (0.04)
- UAE (0.04)
- Republic of Türkiye (0.04)
- Oman (0.04)
- Qatar (0.04)
- Japan > Honshū
- Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Africa
- Genre:
- Research Report (0.64)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Vision (1.00)
- Machine Learning > Neural Networks (1.00)
- Natural Language > Large Language Model (0.68)
- Robots > Humanoid Robots (0.64)
- Information Technology