Metron: Holistic Performance Evaluation Framework for LLM Inference Systems

Open in new window