Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models

Passoni, Riccardo, Ronchini, Francesca, Comanducci, Luca, Serizel, Romain, Antonacci, Fabio

Jul-17-2025–arXiv.org Artificial Intelligence

Text-to-audio models have recently emerged as a powerful technology for generating sound from textual descriptions. However, their high computational demands raise concerns about energy consumption and environmental impact. In this paper, we conduct an analysis of the energy usage of 7 state-of-the-art text-to-audio diffusion-based generative models, evaluating to what extent variations in generation parameters affect energy consumption at inference time. We also aim to identify an optimal balance between audio quality and energy consumption by considering Pareto-optimal solutions across all selected models. Our findings provide insights into the trade-offs between performance and environmental impact, contributing to the development of more efficient generative audio models.

energy consumption, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jul-17-2025

arXiv.org PDF

Add feedback

Country:
- Europe (0.46)
- North America > United States (0.28)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Energy (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Representation & Reasoning > Optimization (0.92)
  - Machine Learning > Neural Networks
    - Deep Learning (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found