Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference

Open in new window