Reviews: Face Reconstruction from Voice using Generative Adversarial Networks

Neural Information Processing Systems 

This paper proposes a convolutional neural network based model to reconstruct a face from spoken speech. The training is done by using supervised GAN. The problem is novel, but the model itself is not so much as using encoder (or embedder) and decoder (or generator) is quite standard, and supervised GAN training has also been popularly used, so in that perspective, its novelty is incremental. But I think this paper needs more thorough experimental study to show the effectiveness of the proposed model: 1. From the experimental results, I suspect that the generated faces only match those attributes (gender, race, etc.) but not much about identities.