Low-rank Momentum Factorization for Memory Efficient Training

Open in new window