GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces
Yu, Hao, Mallick, Rupayan, Betke, Margrit, Bargal, Sarah Adel
–arXiv.org Artificial Intelligence
-- Cartoon avatars have been widely used in various applications, including social media, online tutoring, and gaming. However, existing cartoon avatar datasets and generation methods struggle to present highly expressive avatars with fine-grained facial expressions and are often inspired from real-world identities, raising privacy concerns. T o address these challenges, we propose a novel framework, GenEA va, for generating high-quality cartoon avatars with fine-grained facial expressions. Our approach fine-tunes a state-of-the-art text-to-image diffusion model to synthesize highly detailed and expressive facial expressions. We then incorporate a stylization model that transforms these realistic faces into cartoon avatars while preserving both identity and expression. Leveraging this framework, we introduce the first expressive cartoon avatar dataset, GenEA va 1.0, specifically designed to capture 135 fine-grained facial expressions, featuring 13,230 expressive cartoon avatars with a balanced distribution across genders, racial groups, and age ranges. We demonstrate that our fine-tuned model generates more expressive faces than the state-of-the-art text-to-image diffusion model SDXL. We also verify that the cartoon avatars generated by our framework do not include memorized identities from fine-tuning data. The proposed framework and dataset provide a diverse and expressive benchmark for future research in cartoon avatar generation. I. INTRODUCTION Cartoon avatars have become increasingly important in various digital domains, serving as personalized digital representations in applications such as social media [41], chatbots [22], online tutoring [18], [57], video conferencing [45], virtual reality [5], and video games [40]. As digital communication evolves, cartoon avatars offer a compelling alternative to realistic human representations, providing users with enhanced personalization and privacy, and enriching user engagement and interaction across various platforms.
arXiv.org Artificial Intelligence
Apr-11-2025
- Country:
- Europe (0.46)
- North America > United States (0.28)
- Asia (0.28)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment (0.68)
- Information Technology (0.48)
- Media (0.48)
- Education > Educational Technology
- Technology: