Fast and Memory Optimal Low-Rank Matrix Approximation Se-Young Yun Marc Lelarge

Neural Information Processing Systems 

Our algorithm makes one pass on the data if the columns of M are revealed in a random order, and two passes if the columns of M arrive in an arbitrary order. To reduce its memory footprint and complexity, SLA uses random sparsification, and samples each entry of M with a small probability δ. In turn, SLA is memory optimal as its required memory space scales as k(m+n), the dimension of its output.