Experts are all you need: A Composable Framework for Large Language Model Inference

Open in new window