On the Necessity and Effectiveness of Learning the Prior of Variational Auto-Encoder