Beyond Benchmarks: The Economics of AI Inference

Zhuang, Boqin, Qiao, Jiacheng, Liu, Mingqian, Yu, Mingxing, Hong, Ping, Li, Rui, Song, Xiaoxia, Xu, Xiangjun, Chen, Xu, Ma, Yaoyao, Gao, Yujie

Oct-31-2025–arXiv.org Artificial Intelligence

The inference cost of Large Language Models (LLMs) has become a critical factor in determining their commercial viability and widespread adoption. This paper introduces a quantitative ``economics of inference'' framework, treating the LLM inference process as a compute-driven intelligent production activity. We analyze its marginal cost, economies of scale, and quality of output under various performance configurations. Based on empirical data from WiNEval-3.0, we construct the first ``LLM Inference Production Frontier,'' revealing three principles: diminishing marginal cost, diminishing returns to scale, and an optimal cost-effectiveness zone. This paper not only provides an economic basis for model deployment decisions but also lays an empirical foundation for the future market-based pricing and optimization of AI inference resources.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

Oct-31-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.40)

Industry:
- Energy (0.69)
- Health & Medicine (0.68)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found