A Performance Evaluation of a Quantized Large Language Model on Various Smartphones

Çöplü, Tolga, Loedi, Marc, Bendiken, Arto, Makohin, Mykhailo, Bouw, Joshua J., Cobb, Stephen

Dec-19-2023–arXiv.org Artificial Intelligence

This paper explores the feasibility and performance of on-device large language model (LLM) inference on various Apple iPhone models. Amidst the rapid evolution of generative AI, on-device LLMs offer solutions to privacy, security, and connectivity challenges inherent in cloud-based models. Leveraging existing literature on running multi-billion parameter LLMs on resource-limited devices, our study examines the thermal effects and interaction speeds of a high-performing LLM across different smartphone generations. We present real-world performance results, providing insights into on-device inference capabilities.

application, language model, llm, (16 more...)

arXiv.org Artificial Intelligence

Dec-19-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report > Experimental Study (0.34)

Industry:
- Information Technology (0.49)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)