AITopics | Energy

Collaborating Authors

Energy

Meet the Ethiopian entrepreneur who is reinventing ammonia production

MIT Technology ReviewSep-8-2025, 10:30:00 GMT

After growing up without reliable power at home, Iwnetim Abate is working to develop a steady supply of sustainable energy. "I'm the only one who wears glasses and has eye problems in the family," Iwnetim Abate says with a smile as sun streams in through the windows of his MIT office. "I think it's because of the candles." In the small town in Ethiopia where he grew up, Abate's family had electricity, but it was unreliable. So, for several days each week when they were without power, Abate would finish his homework by candlelight. Today, Abate, 32, is an assistant professor at MIT in the department of materials science and engineering.

abate, artificial intelligence, social media, (16 more...)

MIT Technology Review

Country:

Africa > Ethiopia (0.28)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
North America > United States > Minnesota (0.05)
(5 more...)

Industry:

Materials > Chemicals > Industrial Gases (1.00)
Materials > Chemicals > Agricultural Chemicals (1.00)
Energy (1.00)
Materials > Chemicals > Commodity Chemicals > Ammonia (0.84)

Technology:

Information Technology > Communications > Social Media (0.99)
Information Technology > Artificial Intelligence (0.97)

Add feedback

Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks

Gardner, Jason, Dutta, Ayan, Roy, Swapnoneel, Kreidl, O. Patrick, Boloni, Ladislau

arXiv.org Artificial IntelligenceSep-8-2025

The growing computational demands of deep reinforcement learning (DRL) have raised concerns about the environmental and economic costs of training large-scale models. While algorithmic efficiency in terms of learning performance has been extensively studied, the energy requirements, greenhouse gas emissions, and monetary costs of DRL algorithms remain largely unexplored. In this work, we present a systematic benchmarking study of the energy consumption of seven state-of-the-art DRL algorithms, namely DQN, TRPO, A2C, ARS, PPO, RecurrentPPO, and QR-DQN, implemented using Stable Baselines. Each algorithm was trained for one million steps each on ten Atari 2600 games, and power consumption was measured in real-time to estimate total energy usage, CO2-Equivalent emissions, and electricity cost based on the U.S. national average electricity price. Our results reveal substantial variation in energy efficiency and training cost across algorithms, with some achieving comparable performance while consuming up to 24% less energy (ARS vs. DQN), emitting nearly 68% less CO2, and incurring almost 68% lower monetary cost (QR-DQN vs. RecurrentPPO) than less efficient counterparts. We further analyze the trade-offs between learning performance, training time, energy use, and financial cost, highlighting cases where algorithmic choices can mitigate environmental and economic impact without sacrificing learning performance. This study provides actionable insights for developing energy-aware and cost-efficient DRL practices and establishes a foundation for incorporating sustainability considerations into future algorithmic design and evaluation.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2509.05273

Country: North America > United States > Florida (0.46)

Genre: Research Report > New Finding (0.48)

Industry:

Energy > Power Industry (1.00)
Leisure & Entertainment > Games > Computer Games (0.69)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Kolmogorov-Arnold Network for Interpretable Cyberattack Detection in AGC Systems

Jilan, Jehad, Nambiar, Niranjana Naveen, Saber, Ahmad Mohammad, Paranjape, Alok, Youssef, Amr, Kundur, Deepa

arXiv.org Artificial IntelligenceSep-8-2025

Automatic Generation Control (AGC) is essential for power grid stability but remains vulnerable to stealthy cyberattacks, such as False Data Injection Attacks (FDIAs), which can disturb the system's stability while evading traditional detection methods. Unlike previous works that relied on black-box approaches, this work proposes Kolmogorov-Arnold Networks (KAN) as an interpretable and accurate method for FDIA detection in AGC systems, considering the system nonlinearities. KAN models include a method for extracting symbolic equations, and are thus able to provide more interpretability than the majority of machine learning models. The proposed KAN is trained offline to learn the complex nonlinear relationships between the AGC measurements under different operating scenarios. After training, symbolic formulas that describe the trained model's behavior can be extracted and leveraged, greatly enhancing interpretability. Our findings confirm that the proposed KAN model achieves FDIA detection rates of up to 95.97% and 95.9% for the initial model and the symbolic formula, respectively, with a low false alarm rate, offering a reliable approach to enhancing AGC cybersecurity.

artificial intelligence, kan model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2509.05259

Country: North America > United States (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (1.00)
Energy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

MCANet: A Multi-Scale Class-Specific Attention Network for Multi-Label Post-Hurricane Damage Assessment using UAV Imagery

Liu, Zhangding, Mohammadi, Neda, Taylor, John E.

arXiv.org Artificial IntelligenceSep-8-2025

Rapid and accurate post-hurricane damage assessment is vital for disaster response and recovery. Yet existing CNN-based methods struggle to capture multi-scale spatial features and to distinguish visually similar or co-occurring damage types. To address these issues, we propose MCANet, a multi-label classification framework that learns multi-scale representations and adaptively attends to spatially relevant regions for each damage category. MCANet employs a Res2Net-based hierarchical backbone to enrich spatial context across scales and a multi-head class-specific residual attention module to enhance discrimination. Each attention branch focuses on different spatial granularities, balancing local detail with global context. We evaluate MCANet on the RescueNet dataset of 4,494 UAV images collected after Hurricane Michael. MCANet achieves a mean average precision (mAP) of 91.75%, outperforming ResNet, Res2Net, VGG, MobileNet, EfficientNet, and ViT. With eight attention heads, performance further improves to 92.35%, boosting average precision for challenging classes such as Road Blocked by over 6%. Class activation mapping confirms MCANet's ability to localize damage-relevant regions, supporting interpretability. Outputs from MCANet can inform post-disaster risk mapping, emergency routing, and digital twin-based disaster response. Future work could integrate disaster-specific knowledge graphs and multimodal large language models to improve adaptability to unseen disasters and enrich semantic understanding for real-world decision-making.

artificial intelligence, classification, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2509.04757

Country: North America > United States (1.00)

Genre: Research Report (0.82)

Industry:

Energy (0.48)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.86)

Add feedback

SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View

Xiao, Yongjie, Liang, Hongru, Qin, Peixin, Zhang, Yao, Lei, Wenqiang

arXiv.org Artificial IntelligenceSep-8-2025

Despite the great potential of large language models(LLMs) in machine comprehension, it is still disturbing to fully count on them in real-world scenarios. This is probably because there is no rational explanation for whether the comprehension process of LLMs is aligned with that of experts. In this paper, we propose SCOP to carefully examine how LLMs perform during the comprehension process from a cognitive view. Specifically, it is equipped with a systematical definition of five requisite skills during the comprehension process, a strict framework to construct testing data for these skills, and a detailed analysis of advanced open-sourced and closed-sourced LLMs using the testing data. With SCOP, we find that it is still challenging for LLMs to perform an expert-level comprehension process. Even so, we notice that LLMs share some similarities with experts, e.g., performing better at comprehending local information than global information. Further analysis reveals that LLMs can be somewhat unreliable -- they might reach correct answers through flawed comprehension processes. Based on SCOP, we suggest that one direction for improving LLMs is to focus more on the comprehension process, ensuring all comprehension skills are thoroughly developed during training.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.05

Country:

Asia (1.00)
Europe (0.67)
North America > United States > Florida > Miami-Dade County (0.46)
North America > United States > South Carolina > Greenville County (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Energy-Weighted Flow Matching: Unlocking Continuous Normalizing Flows for Efficient and Scalable Boltzmann Sampling

Dern, Niclas, Redl, Lennart, Pfister, Sebastian, Kollovieh, Marcel, Lüdke, David, Günnemann, Stephan

arXiv.org Machine LearningSep-5-2025

Sampling from unnormalized target distributions, e.g. Boltzmann distributions $μ_{\text{target}}(x) \propto \exp(-E(x)/T)$, is fundamental to many scientific applications yet computationally challenging due to complex, high-dimensional energy landscapes. Existing approaches applying modern generative models to Boltzmann distributions either require large datasets of samples drawn from the target distribution or, when using only energy evaluations for training, cannot efficiently leverage the expressivity of advanced architectures like continuous normalizing flows that have shown promise for molecular sampling. To address these shortcomings, we introduce Energy-Weighted Flow Matching (EWFM), a novel training objective enabling continuous normalizing flows to model Boltzmann distributions using only energy function evaluations. Our objective reformulates conditional flow matching via importance sampling, allowing training with samples from arbitrary proposal distributions. Based on this objective, we develop two algorithms: iterative EWFM (iEWFM), which progressively refines proposals through iterative training, and annealed EWFM (aEWFM), which additionally incorporates temperature annealing for challenging energy landscapes. On benchmark systems, including challenging 55-particle Lennard-Jones clusters, our algorithms demonstrate sample quality competitive with state-of-the-art energy-only methods while requiring up to three orders of magnitude fewer energy evaluations.

artificial intelligence, evaluation, machine learning, (13 more...)

arXiv.org Machine Learning

2509.03726

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Promising Solution (0.46)

Industry: Energy (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series

Guo, Zhengyi, Li, Jiatu, Tang, Wenpin, Yao, David D.

arXiv.org Machine LearningSep-5-2025

This paper develops dimension reduction techniques for accelerating diffusion model inference in the context of synthetic data generation. The idea is to integrate compressed sensing into diffusion models: (i) compress the data into a latent space, (ii) train a diffusion model in the latent space, and (iii) apply a compressed sensing algorithm to the samples generated in the latent space, facilitating the efficiency of both model training and inference. Under suitable sparsity assumptions on data, the proposed algorithm is proved to enjoy faster convergence by combining diffusion model inference with sparse recovery. As a byproduct, we obtain an optimal value for the latent space dimension. We also conduct numerical experiments on a range of datasets, including image data (handwritten digits, medical images, and climate data) and financial time series for stress testing. Key words: Complexity, compressed sensing, diffusion models, inference time, signal recovery, sparsity.

artificial intelligence, diffusion model, machine learning, (19 more...)

arXiv.org Machine Learning

2509.03898

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > India > NCT > New Delhi (0.04)

Genre: Research Report (0.64)

Industry:

Energy (0.68)
Banking & Finance > Trading (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Add feedback

RAGuard: A Novel Approach for in-context Safe Retrieval Augmented Generation for LLMs

Walker, Connor, Aslansefat, Koorosh, Akram, Mohammad Naveed, Papadopoulos, Yiannis

arXiv.org Machine LearningSep-5-2025

Accuracy and safety are paramount in Offshore Wind (OSW) maintenance, yet conventional Large Language Models (LLMs) often fail when confronted with highly specialised or unexpected scenarios. We introduce RAGuard, an enhanced Retrieval-Augmented Generation (RAG) framework that explicitly integrates safety-critical documents alongside technical manuals.By issuing parallel queries to two indices and allocating separate retrieval budgets for knowledge and safety, RAGuard guarantees both technical depth and safety coverage. We further develop a SafetyClamp extension that fetches a larger candidate pool, "hard-clamping" exact slot guarantees to safety. We evaluate across sparse (BM25), dense (Dense Passage Retrieval) and hybrid retrieval paradigms, measuring Technical Recall@K and Safety Recall@K. Both proposed extensions of RAG show an increase in Safety Recall@K from almost 0\% in RAG to more than 50\% in RAGuard, while maintaining Technical Recall above 60\%. These results demonstrate that RAGuard and SafetyClamp have the potential to establish a new standard for integrating safety assurance into LLM-powered decision support in critical maintenance contexts.

large language model, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2509.03768

Country: Europe > United Kingdom > England > East Yorkshire > Hull (0.04)

Genre:

Research Report > Promising Solution (0.50)
Overview > Innovation (0.40)
Research Report > New Finding (0.34)

Industry:

Information Technology (0.93)
Energy (0.89)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Integrated Wheel Sensor Communication using ESP32 -- A Contribution towards a Digital Twin of the Road System

Yordanov, Ventseslav, Schäfer, Simon, Mann, Alexander, Kowalewski, Stefan, Alrifaee, Bassam, Eckstein, Lutz

arXiv.org Artificial IntelligenceSep-5-2025

--While current onboard state estimation methods are adequate for most driving and safety-related applications, they do not provide insights into the interaction between tires and road surfaces. This paper explores a novel communication concept for efficiently transmitting integrated wheel sensor data from an ESP32 microcontroller . Our proposed approach utilizes a publish-subscribe system, surpassing comparable solutions in the literature regarding data transmission volume. We tested this approach on a drum tire test rig with our prototype sensors system utilizing a diverse selection of sample frequencies between 1 Hz and 32 000 Hzto demonstrate the efficacy of our communication concept. The implemented prototype sensor showcases minimal data loss, approximately 0. 1 % of the sampled data, validating the reliability of our developed communication system. This work contributes to advancing real-time data acquisition, providing insights into optimizing integrated wheel sensor communication. A. Motivation Intelligent transportation systems rely on sensors to estimate their own state and that of surrounding objects. Traditional onboard methods, using inertial measurement units (IMUs), wheel encoders, and Global Navigation Satellite Systems (GNSS), operate at frequencies from 1 Hz to several hundred Hz, providing sufficient accuracy for trajectory reconstruction and stability control. However, these methods do not directly capture tire-road interactions. Even safety systems like an-tilock brake (ABS) and electronic stability programs (ESP) rely on conservative assumptions about forces and friction rather than real-time estimation.

artificial intelligence, real time system, sensor, (15 more...)

arXiv.org Artificial Intelligence

2509.04061

Country: Europe (0.28)

Genre: Research Report (0.82)

Industry:

Transportation (1.00)
Energy (0.93)
Information Technology > Smart Houses & Appliances (0.70)
Automobiles & Trucks > Parts Supplier (0.46)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Architecture > Real Time Systems (1.00)

Add feedback

MillGNN: Learning Multi-Scale Lead-Lag Dependencies for Multi-Variate Time Series Forecasting

Wu, Binqing, Shang, Zongjiang, Huang, Jianlong, Chen, Ling

arXiv.org Artificial IntelligenceSep-5-2025

Multi-variate time series (MTS) forecasting is crucial for various applications. Existing methods have shown promising results owing to their strong ability to capture intra- and inter-variate dependencies. However, these methods often overlook lead-lag dependencies at multiple grouping scales, failing to capture hierarchical lead-lag effects in complex systems. To this end, we propose MillGNN, a novel \underline{g}raph \underline{n}eural \underline{n}etwork-based method that learns \underline{m}ult\underline{i}ple grouping scale \underline{l}ead-\underline{l}ag dependencies for MTS forecasting, which can comprehensively capture lead-lag effects considering variate-wise and group-wise dynamics and decays. Specifically, MillGNN introduces two key innovations: (1) a scale-specific lead-lag graph learning module that integrates cross-correlation coefficients and dynamic decaying features derived from real-time inputs and time lags to learn lead-lag dependencies for each scale, which can model evolving lead-lag dependencies with statistical interpretability and data-driven flexibility; (2) a hierarchical lead-lag message passing module that passes lead-lag messages at multiple grouping scales in a structured way to simultaneously propagate intra- and inter-scale lead-lag effects, which can capture multi-scale lead-lag effects with a balance of comprehensiveness and efficiency. Experimental results on 11 datasets demonstrate the superiority of MillGNN for long-term and short-term MTS forecasting, compared with 16 state-of-the-art methods.

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2509.03852

Country: Asia > China > Zhejiang Province (0.28)

Genre: Research Report (1.00)

Industry: Energy > Power Industry (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Data Science > Data Mining (0.84)

Add feedback