Scaling LLM Test-Time Compute with Mobile NPU on Smartphones