Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction Y aodong Y u

Neural Information Processing Systems 

The coding rate can be accurately computed from finite samples of degenerate subspace-like distributions and can learn intrinsic representations in supervised, self-supervised, and unsupervised settings in a unified manner.