Face Reconstruction from Voice using Generative Adversarial Networks

Oct-11-2024, 05:02:02 GMT–Neural Information Processing Systems

Voice profiling aims at inferring various human parameters from their speech, e.g. In this paper, we address the challenge posed by a subtask of voice profiling - reconstructing someone's face from their voice. The task is designed to answer the question: given an audio clip spoken by an unseen person, can we picture a face that has as many common elements, or associations as possible with the speaker, in terms of identity? To address this problem, we propose a simple but effective computational framework based on generative adversarial networks (GANs). The network learns to generate faces from voices by matching the identities of generated faces to those of the speakers, on a training set.

face reconstruction, generate face, generative adversarial network

Neural Information Processing Systems

Oct-11-2024, 05:02:02 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Unsupervised or Indirectly Supervised Learning (0.65)
  - Neural Networks (0.65)