Generalized Cauchy-Schwarz Divergence and Its Deep Learning Applications