Understanding Deep Representation Learning via Layerwise Feature Compression and Discrimination