LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

Open in new window