face shape
IdentiFace : A VGG Based Multimodal Facial Biometric System
Rabea, Mahmoud, Ahmed, Hanya, Mahmoud, Sohaila, Sayed, Nourhan
The development of facial biometric systems has contributed greatly to the development of the computer vision field. Nowadays, there's always a need to develop a multimodal system that combines multiple biometric traits in an efficient, meaningful way. In this paper, we introduce "IdentiFace" which is a multimodal facial biometric system that combines the core of facial recognition with some of the most important soft biometric traits such as gender, face shape, and emotion. We also focused on developing the system using only VGG-16 inspired architecture with minor changes across different subsystems. This unification allows for simpler integration across modalities. It makes it easier to interpret the learned features between the tasks which gives a good indication about the decision-making process across the facial modalities and potential connection. For the recognition problem, we acquired a 99.2% test accuracy for five classes with high intra-class variations using data collected from the FERET database[1]. We achieved 99.4% on our dataset and 95.15% on the public dataset[2] in the gender recognition problem. We were also able to achieve a testing accuracy of 88.03% in the face-shape problem using the celebrity face-shape dataset[3]. Finally, we achieved a decent testing accuracy of 66.13% in the emotion task which is considered a very acceptable accuracy compared to related work on the FER2013 dataset[4].
Revealed: The 6 key signs that a video is a deepfake - and why you should always pay close attention to the EARS
For years, children have been told not to believe everything they see online - but we may need to now extend this lesson to adults. That's because we are in the midst of a'deepfake' surge, where artificial intelligence (AI) is being used to manipulate videos and audio in a way that replicates real life. From an ultra-realistic video of Margot Robbie scrubbing the floor to an unsettling video of Vladimir Putin, several uncanny deepfake videos have hit the headlines in recent years. But how can you tell that all is not as it seems with these videos? Experts have revealed the six key signs that indicate a video is a deepfake - and say you should always pay close attention to the ears.
MFIM: Megapixel Facial Identity Manipulation
Face swapping is a task that changes a facial identity of a given image to that of another person. In this work, we propose a novel face-swapping framework called Megapixel Facial Identity Manipulation (MFIM). The face-swapping model should achieve two goals. First, it should be able to generate a high-quality image. We argue that a model which is proficient in generating a megapixel image can achieve this goal. However, generating a megapixel image is generally difficult without careful model design. Therefore, our model exploits pretrained StyleGAN in the manner of GAN-inversion to effectively generate a megapixel image. Second, it should be able to effectively transform the identity of a given image. Specifically, it should be able to actively transform ID attributes (e.g., face shape and eyes) of a given image into those of another person, while preserving ID-irrelevant attributes (e.g., pose and expression). To achieve this goal, we exploit 3DMM that can capture various facial attributes. Specifically, we explicitly supervise our model to generate a face-swapped image with the desirable attributes using 3DMM. We show that our model achieves state-of-the-art performance through extensive experiments. Furthermore, we propose a new operation called ID mixing, which creates a new identity by semantically mixing the identities of several people. It allows the user to customize the new identity.
Hate your nose? Blame your ancient cousins! Neanderthal DNA dictates the shape, study finds
It's something that many people are self-conscious of, and if you not a fan of your nose, we finally know who to blame. Scientists have revealed that Neanderthal DNA helps dictate the shape of your nose. A new study led by UCL researchers found that a particular gene, which leads to a taller nose, may have been the product of natural selection as ancient humans adapted to colder climates after leaving Africa. Dr Kaustubh Adhikari, who led the study, said: 'In the last 15 years, since the Neanderthal genome has been sequenced, we have been able to learn that our own ancestors apparently interbred with Neanderthals, leaving us with little bits of their DNA. 'Here, we find that some DNA inherited from Neanderthals influences the shape of our faces.
FlowFace: Semantic Flow-guided Shape-aware Face Swapping
Zeng, Hao, Zhang, Wei, Fan, Changjie, Lv, Tangjie, Wang, Suzhen, Zhang, Zhimeng, Ma, Bowen, Li, Lincheng, Ding, Yu, Yu, Xin
In this work, we propose a semantic flow-guided two-stage framework for shape-aware face swapping, namely FlowFace. Unlike most previous methods that focus on transferring the source inner facial features but neglect facial contours, our FlowFace can transfer both of them to a target face, thus leading to more realistic face swapping. Concretely, our FlowFace consists of a face reshaping network and a face swapping network. The face reshaping network addresses the shape outline differences between the source and target faces. It first estimates a semantic flow (i.e., face shape differences) between the source and the target face, and then explicitly warps the target face shape with the estimated semantic flow. After reshaping, the face swapping network generates inner facial features that exhibit the identity of the source face. We employ a pre-trained face masked autoencoder (MAE) to extract facial features from both the source face and the target face. In contrast to previous methods that use identity embedding to preserve identity information, the features extracted by our encoder can better capture facial appearances and identity information. Then, we develop a cross-attention fusion module to adaptively fuse inner facial features from the source face with the target facial attributes, thus leading to better identity preservation. Extensive quantitative and qualitative experiments on in-the-wild faces demonstrate that our FlowFace outperforms the state-of-the-art significantly.
Deep-MDS Framework for Recovering the 3D Shape of 2D Landmarks from a Single Image
Kamyab, Shima, Azimifar, Zohreh
In this paper, a low parameter deep learning framework utilizing the Non-metric Multi-Dimensional scaling (NMDS) method, is proposed to recover the 3D shape of 2D landmarks on a human face, in a single input image. Hence, NMDS approach is used for the first time to establish a mapping from a 2D landmark space to the corresponding 3D shape space. A deep neural network learns the pairwise dissimilarity among 2D landmarks, used by NMDS approach, whose objective is to learn the pairwise 3D Euclidean distance of the corresponding 2D landmarks on the input image. This scheme results in a symmetric dissimilarity matrix, with the rank larger than 2, leading the NMDS approach toward appropriately recovering the 3D shape of corresponding 2D landmarks. In the case of posed images and complex image formation processes like perspective projection which causes occlusion in the input image, we consider an autoencoder component in the proposed framework, as an occlusion removal part, which turns different input views of the human face into a profile view. The results of a performance evaluation using different synthetic and real-world human face datasets, including Besel Face Model (BFM), CelebA, CoMA - FLAME, and CASIA-3D, indicates the comparable performance of the proposed framework, despite its small number of training parameters, with the related state-of-the-art and powerful 3D reconstruction methods from the literature, in terms of efficiency and accuracy.
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
Ling, Jun, Tan, Xu, Chen, Liyang, Li, Runnan, Zhang, Yuchao, Zhao, Sheng, Song, Li
While previous speech-driven talking face generation methods have made significant progress in improving the visual quality and lip-sync quality of the synthesized videos, they pay less attention to lip motion jitters which greatly undermine the realness of talking face videos. What causes motion jitters, and how to mitigate the problem? In this paper, we conduct systematic analyses on the motion jittering problem based on a state-of-the-art pipeline that uses 3D face representations to bridge the input audio and output video, and improve the motion stability with a series of effective designs. We find that several issues can lead to jitters in synthesized talking face video: 1) jitters from the input 3D face representations; 2) training-inference mismatch; 3) lack of dependency modeling among video frames. Accordingly, we propose three effective solutions to address this issue: 1) we propose a gaussian-based adaptive smoothing module to smooth the 3D face representations to eliminate jitters in the input; 2) we add augmented erosions on the input data of the neural renderer in training to simulate the distortion in inference to reduce mismatch; 3) we develop an audio-fused transformer generator to model dependency among video frames. Besides, considering there is no off-the-shelf metric for measuring motion jitters in talking face video, we devise an objective metric (Motion Stability Index, MSI), to quantitatively measure the motion jitters by calculating the reciprocal of variance acceleration. Extensive experimental results show the superiority of our method on motion-stable face video generation, with better quality than previous systems.
#011 Machine Learning - Decision three - Master Data Science 15.08.2022
In today's post, we are going to talk about one of the learning algorithms that are very powerful and is used in many machine learning applications. It is called decision trees and tree ensembles. It is a very powerful tool that is well worth having in your toolbox. In this post, we'll learn about decision trees and we'll see how you can apply them in your own machine learning projects. So, let's begin with our post. To explain how decision trees work, we are going to use the following cat classification example. Let's imagine that you are running a cat adoption center and given a few features, you want to train a classifier to quickly tell you if an animal is a cat or not.
People with square faces are seen as more AGGRESSIVE than those with oval faces, study finds
From Zac Efron to Margot Robbie, many of the world's most beautiful celebrities are known for their square faces. Now, a new study claims that people with this face shape are seen as more aggressive than those with oval faces, such as Rihanna and Ben Affleck. Researchers from the University of New South Wales measured the facial-width-to-height ratio (FWHR) of 17,607 passport images of male and female faces, before asking people to rate them for aggression. The results revealed that faces with a high FWHR (square faces) were rated as more aggressive than people with low FWHR (oval faces) – particularly if they belonged to young men. From Zac Efron to Margot Robbie, many of the world's most beautiful celebrities are known for their square faces Researchers from the University of New South Wales measured the facial-width-to-height ratio (FWHR) of 17,607 passport images of male and female faces, before asking people to rate them for aggression.
CelebHair: A New Large-Scale Dataset for Hairstyle Recommendation based on CelebA
Chen, Yutao, Zhang, Yuxuan, Huang, Zhongrui, Luo, Zhenyao, Chen, Jinpeng
In this paper, we present a new large-scale dataset for hairstyle recommendation, CelebHair, based on the celebrity facial attributes dataset, CelebA. Our dataset inherited the majority of facial images along with some beauty-related facial attributes from CelebA. Additionally, we employed facial landmark detection techniques to extract extra features such as nose length and pupillary distance, and deep convolutional neural networks for face shape and hairstyle classification. Empirical comparison has demonstrated the superiority of our dataset to other existing hairstyle-related datasets regarding variety, veracity, and volume. Analysis and experiments have been conducted on the dataset in order to evaluate its robustness and usability.