ENOVA: Autoscaling towards Cost-effective and Stable Serverless LLM Serving

Open in new window