UniFLG: Unified Facial Landmark Generator from Text or Speech

Mitsui, Kentaro, Hono, Yukiya, Sawada, Kei

May-18-2023–arXiv.org Artificial Intelligence

Talking face generation has been extensively investigated owing to its wide applicability. The two primary frameworks used for talking face generation comprise a text-driven framework, which generates synchronized speech and talking faces from text, and a speech-driven framework, which generates talking faces from speech. To integrate these frameworks, this paper proposes a unified facial landmark generator (UniFLG). The proposed system exploits end-to-end text-to-speech not only for synthesizing speech but also for extracting a series of latent representations that are common to text and speech, and feeds it to a landmark decoder to generate facial landmarks. We demonstrate that our system achieves higher naturalness in both speech synthesis and facial landmark generation compared to the state-of-the-art text-driven method. We further demonstrate that our system can generate facial landmarks from speech of speakers without facial video data or even speech data.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

May-18-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > Santa Clara County
      - Sunnyvale (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - Alberta
      - Census Division No. 6 > Calgary Metropolitan Region
        Calgary (0.04)
      - Census Division No. 15 > Improvement District No. 9
        Banff (0.04)
- Europe
  - United Kingdom > England
    - Surrey > Guildford (0.04)
    - East Sussex > Brighton (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
  - Austria > Styria
    - Graz (0.04)
- Asia
  - Singapore (0.04)
  - South Korea > Incheon
    - Incheon (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
  - India > Telangana
    - Hyderabad (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.68)
  - Machine Learning > Neural Networks (0.68)
  - Speech > Speech Synthesis (0.57)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found