Benchmarking Energy Efficiency of Large Language Models Using vLLM

Sep-12-2025–arXiv.org Artificial Intelligence

Abstract--The prevalence of Large Language Models (LLMs) is having an growing impact on the climate due to the substantial energy required for their deployment and use. T o create awareness for developers who are implementing LLMs in their products, there is a strong need to collect more information about the energy efficiency of LLMs. While existing research has evaluated the energy efficiency of various models, these benchmarks often fall short of representing realistic production scenarios. In this paper, we introduce the LLM Efficiency Benchmark, designed to simulate real-world usage conditions. We examine how factors such as model size, architecture, and concurrent request volume affect inference energy efficiency. Our findings demonstrate that it is possible to create energy efficiency benchmarks that better reflect practical deployment conditions, providing valuable insights for developers aiming to build more sustainable AI systems. Large Language Models (LLMs) have seen a significant rise in popularity in recent years. They are increasingly integrated into everyday applications, such as Google's AI-generated summaries for search results, OpenAI's GPT -4o, and the growing adoption of AI agents across various platforms.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Sep-12-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands (0.14)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Energy (1.00)
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found