HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods

Li, Yongyuan, Qin, Xiuyuan, Liang, Chao, Wei, Mingqiang

Sep-14-2023–arXiv.org Artificial Intelligence

Talking Face Generation (TFG) aims to reconstruct facial movements to achieve high natural lip movements from audio and facial features that are under potential connections. Existing TFG methods have made significant advancements to produce natural and realistic images. However, most work rarely takes visual quality into consideration. It is challenging to ensure lip synchronization while avoiding visual quality degradation in cross-modal generation methods. To address this issue, we propose a universal High-Definition Teeth Restoration Network, dubbed HDTR-Net, for arbitrary TFG methods. HDTR-Net can enhance teeth regions at an extremely fast speed while maintaining synchronization, and temporal consistency. In particular, we propose a Fine-Grained Feature Fusion (FGFF) module to effectively capture fine texture feature information around teeth and surrounding regions, and use these features to fine-grain the feature map to enhance the clarity of teeth. Extensive experiments show that our method can be adapted to arbitrary TFG methods without suffering from lip synchronization and frame coherence. Another advantage of HDTR-Net is its real-time generation ability. Also under the condition of high-definition restoration of talking face video synthesis, its inference speed is $300\%$ faster than the current state-of-the-art face restoration based on super-resolution.

computer vision, computer vision and pattern recognition, hdtr-net, (10 more...)

arXiv.org Artificial Intelligence

Sep-14-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Washington > King County
      - Seattle (0.04)
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - Massachusetts > Suffolk County
      - Boston (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.05)
    - California
      - Los Angeles County > Long Beach (0.04)
      - San Diego County > San Diego (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Surrey > Guildford (0.04)
    - Nottinghamshire > Nottingham (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.05)
  - Italy > Veneto
    - Venice (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
- Asia
  - Macao (0.04)
  - China
    - Jiangsu Province > Nanjing (0.05)
    - Guangdong Province > Shenzhen (0.04)
- Africa > Central African Republic
  - Ombella-M'Poko > Bimbo (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Vision > Face Recognition (0.87)
    - Machine Learning > Neural Networks
      - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found