Expressive Gaussian Human Avatars from Monocular RGB Video
–Neural Information Processing Systems
Nuanced expressiveness, especially through detailed hand and facial expressions, is pivotal for enhancing the realism and vitality of digital human representations. In this work, we aim to learn expressive human avatars from a monocular RGB video; a setting that introduces new challenges in capturing and animating finegrained details. To this end, we introduce EVA, a drivable human model that can recover fine details based on 3D Gaussians and an expressive parametric human model, SMPL-X. Focused on enhancing expressiveness, our work makes three key contributions. First, we highlight the importance of aligning the SMPL-X model with the video frames for effective avatar learning.
Neural Information Processing Systems
May-28-2025, 09:52:45 GMT
- Country:
- North America > United States > Texas (0.14)
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.93)
- Vision (1.00)
- Information Technology > Artificial Intelligence