Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management

Open in new window