Expressive Gaussian Human Avatars from Monocular RGB Video

May-28-2025, 09:52:45 GMT–Neural Information Processing Systems

Nuanced expressiveness, especially through detailed hand and facial expressions, is pivotal for enhancing the realism and vitality of digital human representations. In this work, we aim to learn expressive human avatars from a monocular RGB video; a setting that introduces new challenges in capturing and animating finegrained details. To this end, we introduce EVA, a drivable human model that can recover fine details based on 3D Gaussians and an expressive parametric human model, SMPL-X. Focused on enhancing expressiveness, our work makes three key contributions. First, we highlight the importance of aligning the SMPL-X model with the video frames for effective avatar learning.

artificial intelligence, avatar, machine learning, (18 more...)

Neural Information Processing Systems

May-28-2025, 09:52:45 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Texas (0.14)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.93)
  - Vision (1.00)