energy optimization
MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection
Huang, Zhengxiang, Niu, Chaoyue, Wang, Zhaode, Xue, Jiarui, Zhang, Hanming, Wang, Yugang, Xin, Zewei, Jiang, Xiaotang, Lv, Chengfei, Wu, Fan, Chen, Guihai
As the demand for on-device Large Language Model (LLM) inference grows, energy efficiency has become a major concern, especially for battery-limited mobile devices. Our analysis shows that the memory-bound LLM decode phase dominates energy use, and yet most existing works focus on accelerating the prefill phase, neglecting energy concerns. We introduce Adaptive Energy-Centric Core Selection (AECS) and integrate it into MNN to create the energy-efficient version, MNN-AECS, the first engine-level system solution without requiring root access or OS modifications for energy-efficient LLM decoding. MNN-AECS is designed to reduce LLM decoding energy while keeping decode speed within an acceptable slowdown threshold by dynamically selecting low-power CPU cores. MNN-AECS is evaluated across 5 Android and 2 iOS devices on 5 popular LLMs of various sizes. Compared to original MNN, MNN-AECS cuts down energy use by 23% without slowdown averaged over all 7 devices and 4 datasets. Against other engines, including llama.cpp, executorch, mllm, and MediaPipe, MNN-AECS delivers 39% to 78% energy saving and 12% to 363% speedup on average.
- Asia > China > Shanghai > Shanghai (0.05)
- North America > United States > District of Columbia > Washington (0.05)
- North America > United States > New York > New York County > New York City (0.05)
- (11 more...)
Energy Optimized Piecewise Polynomial Approximation Utilizing Modern Machine Learning Optimizers
Waclawek, Hannes, Huber, Stefan
This work explores an extension of ML-optimized piecewise polynomial approximation by incorporating energy optimization as an additional objective. Traditional closed-form solutions enable continuity and approximation targets but lack flexibility in accommodating complex optimization goals. By leveraging modern gradient descent optimizers within TensorFlow, we introduce a framework that minimizes total curvature in cam profiles, leading to smoother motion and reduced energy consumption for input data that is unfavorable for sole approximation and continuity optimization. Experimental results confirm the effectiveness of this approach, demonstrating its potential to improve efficiency in scenarios where input data is noisy or suboptimal for conventional methods.
Toward Cross-Layer Energy Optimizations in Machine Learning Systems
Chung, Jae-Won, Chowdhury, Mosharaf
The enormous energy consumption of machine learning (ML) and generative AI workloads shows no sign of waning, taking a toll on operating costs, power delivery, and environmental sustainability. Despite a long line of research on energy-efficient hardware, we found that software plays a critical role in ML energy optimization through two recent works: Zeus and Perseus. This is especially true for large language models (LLMs) because their model sizes and, therefore, energy demands are growing faster than hardware efficiency improvements. Therefore, we advocate for a cross-layer approach for energy optimizations in ML systems, where hardware provides architectural support that pushes energy-efficient software further, while software leverages and abstracts the hardware to develop techniques that bring hardware-agnostic energy-efficiency gains.
- North America > United States > Michigan (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Energy Optimization for HVAC Systems in Multi-VAV Open Offices: A Deep Reinforcement Learning Approach
Wang, Hao, Chen, Xiwen, Vital, Natan, Duffy, Edward., Razi, Abolfazl
With more than 32% of the global energy used by commercial and residential buildings, there is an urgent need to revisit traditional approaches to Building Energy Management (BEM). With HVAC systems accounting for about 40% of the total energy cost in the commercial sector, we propose a low-complexity DRL-based model with multi-input multi-output architecture for the HVAC energy optimization of open-plan offices, which uses only a handful of controllable and accessible factors. The efficacy of our solution is evaluated through extensive analysis of the overall energy consumption and thermal comfort levels compared to a baseline system based on the existing HVAC schedule in a real building. This comparison shows that our method achieves 37% savings in energy consumption with minimum violation (<1%) of the desired temperature range during work hours. It takes only a total of 40 minutes for 5 epochs (about 7.75 minutes per epoch) to train a network with superior performance and covering diverse conditions for its low-complexity architecture; therefore, it easily adapts to changes in the building setups, weather conditions, occupancy rate, etc. Moreover, by enforcing smoothness on the control strategy, we suppress the frequent and unpleasant on/off transitions on HVAC units to avoid occupant discomfort and potential damage to the system. The generalizability of our model is verified by applying it to different building models and under various weather conditions.
- North America > United States (1.00)
- Asia (0.28)
- Overview (1.00)
- Research Report > New Finding (0.67)
- Construction & Engineering > HVAC (1.00)
- Government > Regional Government > North America Government > United States Government (0.67)
- Energy > Oil & Gas > Upstream (0.46)
SEO: Safety-Aware Energy Optimization Framework for Multi-Sensor Neural Controllers at the Edge
Odema, Mohanad, Ferlez, James, Shoukry, Yasser, Faruque, Mohammad Abdullah Al
Runtime energy management has become quintessential for multi-sensor autonomous systems at the edge for achieving high performance given the platform constraints. Typical for such systems, however, is to have their controllers designed with formal guarantees on safety that precede in priority such optimizations, which in turn limits their application in real settings. In this paper, we propose a novel energy optimization framework that is aware of the autonomous system's safety state, and leverages it to regulate the application of energy optimization methods so that the system's formal safety properties are preserved. In particular, through the formal characterization of a system's safety state as a dynamic processing deadline, the computing workloads of the underlying models can be adapted accordingly. For our experiments, we model two popular runtime energy optimization methods, offloading and gating, and simulate an autonomous driving system (ADS) use-case in the CARLA simulation environment with performance characterizations obtained from the standard Nvidia Drive PX2 ADS platform. Our results demonstrate that through a formal awareness of the perceived risks in the test case scenario, energy efficiency gains are still achieved (reaching 89.9%) while maintaining the desired safety properties.
- Energy (1.00)
- Information Technology (0.70)
The Future of Mobility, Fuelled by Artificial Intelligence and Distributed Ledger Technology
What opportunities exist for AI and Distributed Ledger Technology in Mobility and how can emerging companies capture the short and long term value? The digital transformation in the Automotive Industry is creating more data than ever before. According to a study conducted by McKinsey, the value pool of car-data-monetization could be as large as $750 billion by 2030. According to the study, the opportunity for auto manufacturers hinges on their ability to 1) quickly build and test automotive data-driven products and services and 2) develop new business models built on technological innovation, advanced capabilities, and partnerships that push the boundaries of the automotive industry. Given these two goals, auto manufacturers will be creating a myriad of opportunities for technology developers, startups, insurance providers, data management servers and many more stakeholders.
- Information Technology > Security & Privacy (1.00)
- Automobiles & Trucks > Manufacturer (1.00)
- Information Technology > Services > e-Commerce Services (0.64)
- Transportation > Ground > Road (0.49)
The Future of Mobility, Fuelled by Artificial Intelligence and Distributed Ledger Technology
What opportunities exist for AI and Distributed Ledger Technology in Mobility and how can emerging companies capture the short and long term value? The digital transformation in the Automotive Industry is creating more data than ever before. According to a study conducted by McKinsey, research on car-data-monetization suggests that this value pool could be as large as $750 billion by 2030. According to the study, the opportunity for auto manufacturers hinges on their ability to 1) quickly build and test automotive data-driven products and services and 2) develop new business models built on technological innovation, advanced capabilities, and partnerships that push the boundaries of the automotive industry. Given these two goals, auto manufacturers will be creating a myriad of opportunities for technology developers, startups, insurance providers, data management servers and many more stakeholders.
- Automobiles & Trucks > Manufacturer (1.00)
- Information Technology > Services > e-Commerce Services (0.72)
- Transportation > Ground > Road (0.50)
Optimal Planning Strategy for Ambush Avoidance
Boidot, Emmanuel (Georgia Institute of Technology) | Marzuoli, Aude (Georgia Institute of Technology) | Feron, Eric (Georgia Institute of Technology)
Operating vehicles in adversarial environments between a recurring origin-destination pair requires new planning techniques. Such a technique, presented in this paper, is a game inspired by Ruckle’s original contribution. The goal of the first player is to minimize the expected casualties undergone by a moving agent. The goal of the second player is to maximize this damage. The outcome of the game is obtained via a linear program that solves the corresponding minmax optimization problem over this outcome. The formulation originally proposed by Feron and Joseph is extended to different environment models in order to compute routing strategies over unstructured environments. To compare these methods for increasingly accurate representations of the environment, a grid-based model is chosen to represent the environment and the existence of a sufficient network size is highlighted. A global framework for the generation of realistic routing strategies between any two points is described. Finally the practicality of the proposed framework is illustrated on real world environments.
- Europe > Monaco (0.05)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
A Complete framework for ambush avoidance in realistic environments
Boidot, Emmanuel, Marzuoli, Aude, Feron, Eric
Operating vehicles in adversarial environments between a recurring origin-destination pair requires new planning techniques. A two players zero-sum game is introduced. The goal of the first player is to minimize the expected casualties undergone by a convoy. The goal of the second player is to maximize this damage. The outcome of the game is obtained via a linear program that solves the corresponding minmax optimization problem over this outcome. Different environment models are defined in order to compute routing strategies over unstructured environments. To compare these methods for increasingly accurate representations of the environment, a grid-based model is chosen to represent the environment and the existence of a sufficient network size is highlighted. A global framework for the generation of realistic routing strategies between any two points is described. This framework requires a good assessment of the potential casualties at any location, therefore the most important parameters are identified. Finally the framework is tested on real world environments.
- Europe > Monaco (0.05)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- North America > United States > California > Los Angeles County > Pasadena (0.04)
- Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)