The Role of Mutual Information in Variational Classifiers