Optimal Sparse Linear Encoders and Sparse PCA

Neural Information Processing Systems 

Principal components analysis~(PCA) is the optimal linear encoder of data. Sparse linear encoders (e.g., sparse PCA) produce more interpretable features that can promote better generalization.