Deep CNNs Meet Global Covariance Pooling: Better Representation and Generalization