Reparameterized Variational Divergence Minimization for Stable Imitation