Joint Learning of Depth and Appearance for Portrait Image Animation