Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

Open in new window