MPCache: MPC-Friendly KV Cache Eviction for Efficient Private LLM Inference

Open in new window