Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Open in new window