Bridging Maximum Likelihood and Adversarial Learning via $\alpha$-Divergence