Goto

Collaborating Authors

 energy consumption


The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization

Neural Information Processing Systems

As the adoption of Generative AI in real-world services grow explosively, energy has emerged as a critical bottleneck resource. However, energy remains a metric that is often overlooked, under-explored, or poorly understood in the context of building ML systems. We present the ML.ENERGY Benchmark, a benchmark suite and tool for measuring inference energy consumption under realistic service environments, and the corresponding ML.ENERGY Leaderboard, which have served as a valuable resource for those hoping to understand and optimize the energy consumption of their generative AI services. In this paper, we explain four key design principles for benchmarking ML energy we have acquired over time, and then describe how they are implemented in the ML.ENERGY Benchmark. We then highlight results from the early 2025 iteration of the benchmark, including energy measurements of 40 widely used model architectures across 6 different tasks, case studies of how ML design choices impact energy consumption, and how automated optimization recommendations can lead to significant (sometimes more than 40%) energy savings without changing what is being computed by the model. The ML.ENERGY Benchmark is open-source and can be easily extended to various customized models and application scenarios.


S-MLP Spiking Neuron CONV BN

Neural Information Processing Systems

ThisEmbeddingfrequency-domain AEmbe Maximbalance, Empirically we, on ar Spiking gue, is the Transformers, root cause of adopting degraded A feature vg-Pooling representation (low-pass) in for SNNs to.


The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization

Neural Information Processing Systems

As the adoption of Generative AI in real-world services grow explosively, energy has emerged as a critical bottleneck resource. However, energy remains a metric that is often overlooked, under-explored, or poorly understood in the context of building ML systems. We present the ML.ENERGY Benchmark, a benchmark suite and tool for measuring inference energy consumption under realistic service environments, and the corresponding ML.ENERGY Leaderboard, which have served as a valuable resource for those hoping to understand and optimize the energy consumption of their generative AI services. In this paper, we explain four key design principles for benchmarking ML energy we have acquired over time, and then describe how they are implemented in the ML.ENERGY Benchmark. We then highlight results from the early 2025 iteration of the benchmark, including energy measurements of 40 widely used model architectures across 6 different tasks, case studies of how ML design choices impact energy consumption, and how automated optimization recommendations can lead to significant (sometimes more than 40%) energy savings without changing what is being computed by the model. The ML.ENERGY Benchmark is open-source and can be easily extended to various customized models and application scenarios.


Design tweaks promote responsible AI use for environmental protection, research shows

AIHub

Artificial intelligence systems that ask users to pause to consider AI's energy consumption and environmental impacts are likely to reduce unnecessary AI use, new research by Oregon State University suggests. The findings, published in Science Communication, are important as AI is already using electricity on scales that can be meaningfully compared to households, factories and towns. For example, the electricity needed to train a large language model would power 120 homes for a year, the researchers note; one AI-generated image has roughly the same energy cost as charging a smartphone. With about 85% of the world's energy still coming from fossil fuels, every megawatt-hour that can be carved from AI's electricity profile is significant, says the study's leader, Cheng "Chris" Chen of the OSU College of Liberal Arts. "Despite AI's substantial environmental impacts, information about those impacts is rarely disclosed or effectively communicated to everyday users of AI systems," said Chen, assistant professor in the School of Communication.


Ditch the niceties in AI prompts to save energy use, say researchers

New Scientist

ChatGPT now processes around 2.5 billion queries every day UN researchers are urging people to be less polite to artificial intelligences after a report found that cutting words from prompts could reduce ChatGPT's energy consumption by up to 25 per cent. Removing "please", "thank you" and other unnecessary words from AI prompts could save 87 to 98 gigawatt-hours of electricity per year, the report from the UN University Institute for Water, Environment and Health (UNU-INWEH) found. That is the equivalent of the annual residential electricity use of up to 760,000 people in sub-Saharan Africa. 'Flashes of brilliance and frustration': I let an AI agent run my day To reduce their energy consumption and carbon footprint, people should write concise prompts, avoid getting sucked into conversation loops and refrain from starting relationships with AI, the researchers said. "We are not saying be rude to your AI. But don't fall into the interaction trap and don't go falling in love with it either," says Kaveh Madani at UNU-INWEH.


CoNBONet: Conformalized Neuroscience-inspired Bayesian Operator Network for Reliability Analysis

arXiv.org Machine Learning

Time-dependent reliability analysis of nonlinear dynamical systems under stochastic excitations is a critical yet computationally demanding task. Conventional approaches, such as Monte Carlo simulation, necessitate repeated evaluations of computationally expensive numerical solvers, leading to significant computational bottlenecks. To address this challenge, we propose \textit{CoNBONet}, a neuroscience-inspired surrogate model that enables fast, energy-efficient, and uncertainty-aware reliability analysis, providing a scalable alternative to techniques such as Monte Carlo simulations. CoNBONet, short for \textbf{Co}nformalized \textbf{N}euroscience-inspired \textbf{B}ayesian \textbf{O}perator \textbf{Net}work, leverages the expressive power of deep operator networks while integrating neuroscience-inspired neuron models to achieve fast, low-power inference. Unlike traditional surrogates such as Gaussian processes, polynomial chaos expansions, or support vector regression, that may face scalability challenges for high-dimensional, time-dependent reliability problems, CoNBONet offers \textit{fast and energy-efficient inference} enabled by a neuroscience-inspired network architecture, \textit{calibrated uncertainty quantification with theoretical guarantees} via split conformal prediction, and \textit{strong generalization capability} through an operator-learning paradigm that maps input functions to system response trajectories. Validation of the proposed CoNBONet for various nonlinear dynamical systems demonstrates that CoNBONet preserves predictive fidelity, and achieves reliable coverage of failure probabilities, making it a powerful tool for robust and scalable reliability analysis in engineering design.


SpikedAttention: Training-Free and Fully Spike-Driven Transformer-to-SNN Conversion with Winner-Oriented Spike Shift for Softmax Operation

Neural Information Processing Systems

Event-driven spiking neural networks(SNNs) are promising neural networks that reduce the energy consumption of continuously growing AI models. Recently, keeping pace with the development of transformers, transformer-based SNNs were presented.


Prioritizing energy intelligence for sustainable growth

MIT Technology Review

As AI drives extraordinary power demands, energy intelligence is rapidly becoming a core business metric. Loudoun County, Virginia, once known for its pastoral scenery and proximity to Washington, DC, has earned a more modern reputation in recent years: The area has the highest concentration of data centers on the planet. Ten years ago, these facilities powered email and e-commerce. Today, thanks to the meteoric rise in demand for AI-infused everything, local utility Dominion Energy is working hard to keep pace with surging power demands. The pressure is so acute that Dulles International Airport is constructing the largest airport solar installation in the country, a highly visible bid to bolster the region's power mix. Data center campuses like Loudoun's are cropping up across the country to accommodate an insatiable appetite for AI.