PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Open in new window