GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces

Yu, Hao, Mallick, Rupayan, Betke, Margrit, Bargal, Sarah Adel

Apr-11-2025–arXiv.org Artificial Intelligence

-- Cartoon avatars have been widely used in various applications, including social media, online tutoring, and gaming. However, existing cartoon avatar datasets and generation methods struggle to present highly expressive avatars with fine-grained facial expressions and are often inspired from real-world identities, raising privacy concerns. T o address these challenges, we propose a novel framework, GenEA va, for generating high-quality cartoon avatars with fine-grained facial expressions. Our approach fine-tunes a state-of-the-art text-to-image diffusion model to synthesize highly detailed and expressive facial expressions. We then incorporate a stylization model that transforms these realistic faces into cartoon avatars while preserving both identity and expression. Leveraging this framework, we introduce the first expressive cartoon avatar dataset, GenEA va 1.0, specifically designed to capture 135 fine-grained facial expressions, featuring 13,230 expressive cartoon avatars with a balanced distribution across genders, racial groups, and age ranges. We demonstrate that our fine-tuned model generates more expressive faces than the state-of-the-art text-to-image diffusion model SDXL. We also verify that the cartoon avatars generated by our framework do not include memorized identities from fine-tuning data. The proposed framework and dataset provide a diverse and expressive benchmark for future research in cartoon avatar generation. I. INTRODUCTION Cartoon avatars have become increasingly important in various digital domains, serving as personalized digital representations in applications such as social media [41], chatbots [22], online tutoring [18], [57], video conferencing [45], virtual reality [5], and video games [40]. As digital communication evolves, cartoon avatars offer a compelling alternative to realistic human representations, providing users with enhanced personalization and privacy, and enriching user engagement and interaction across various platforms.

artificial intelligence, expression, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Apr-11-2025

arXiv.org PDF

Add feedback

Country:
- Europe (0.46)
- North America > United States (0.28)
- Asia (0.28)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment (0.68)
- Information Technology (0.48)
- Media (0.48)
- Education > Educational Technology
  - Educational Software > Computer Based Training (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Vision > Face Recognition (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found