InstCache: A Predictive Cache for LLM Serving

Open in new window