AITopics

2506.14923

Country:

Europe (0.68)
North America > United States > California (0.46)

Genre: Research Report > New Finding (0.46)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy > Renewable > Geothermal > Geothermal Energy Systems and Facilities > Geothermal System for Power Generation > Enhanced Geothermal System (EGS) (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceAug-5-2025

Generative AI as a Pillar for Predicting 2D and 3D Wildfire Spread: Beyond Physics-Based Models and Traditional Deep Learning

Xu, Haowen, Zlatanova, Sisi, Liang, Ruiyu, Canbulat, Ismet

Wildfires increasingly threaten human life, ecosystems, and infrastructure, with events like the 2025 Palisades and Eaton fires in Los Angeles County underscoring the urgent need for more advanced prediction frameworks. Existing physics-based and deep learning models struggle to capture dynamic wildfire spread across both 2D and 3D domains, especially when incorporating real-time, multimodal geospatial data. This paper explores how generative Artificial Intelligence (AI) models-such as GANs, VAEs, and Transformers-can serve as transformative tools for wildfire prediction and simulation. These models offer superior capabilities in managing uncertainty, integrating multimodal inputs, and generating realistic, scalable wildfire scenarios. We introduce a new paradigm that leverages large language models (LLMs) for literature synthesis, classification, and knowledge extraction, conducting a systematic review of recent studies applying generative AI to fire prediction and monitoring. We highlight how generative approaches uniquely address challenges faced by traditional simulation and deep learning methods. Finally, we outline five key future directions for generative AI in wildfire management, including unified multimodal modeling of 2D and 3D dynamics, agentic AI systems and chatbots for decision intelligence, and real-time scenario generation on mobile devices, along with a discussion of critical challenges. Our findings advocate for a paradigm shift toward multimodal generative frameworks to support proactive, data-informed wildfire response.

machine learning, natural language, prediction, (18 more...)

2506.02485

Country: North America > United States > California > Los Angeles County (0.34)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)
Health & Medicine (0.92)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Meng, Dekang, Haider, Rabab, van Hentenryck, Pascal

Flow-Aware GNN for Transmission Network Reconfiguration via Substation Breaker Optimization

arXiv.org Artificial IntelligenceAug-5-2025

This paper introduces OptiGridML, a machine learning framework for discrete topology optimization in power grids. The task involves selecting substation breaker configurations that maximize cross-region power exports, a problem typically formulated as a mixed-integer program (MIP) that is NP-hard and computationally intractable for large networks. OptiGridML replaces repeated MIP solves with a two-stage neural architecture: a line-graph neural network (LGNN) that approximates DC power flows for a given network topology, and a heterogeneous GNN (HeteroGNN) that predicts breaker states under structural and physical constraints. A physics-informed consistency loss connects these components by enforcing Kirchhoff's law on predicted flows. Experiments on synthetic networks with up to 1,000 breakers show that OptiGridML achieves power export improvements of up to 18% over baseline topologies, while reducing inference time from hours to milliseconds. These results demonstrate the potential of structured, flow-aware GNNs for accelerating combinatorial optimization in physical networked systems.

artificial intelligence, machine learning, substation, (16 more...)

2508.01951

Genre: Research Report (0.70)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Das, Abhinav, Schlüter, Stephan

Regime-Aware Conditional Neural Processes with Multi-Criteria Decision Support for Operational Electricity Price Forecasting

arXiv.org Machine LearningAug-4-2025

The energy market has faced a significant structural change in the past decade. The global strife for decarbonization is encouraging the use of renewable energy sources, thus affecting the traditional supply-demand pattern, which were historically dominated by fossil fuels like coal, oil, and natural gas [18]. The growing integration of renewable energy sources into the power supply increases uncertainties in the electricity market due to intermittent nature of the sources such as wind or sunshine [57]. The volatility of the generation sources causes high price shocks and regime changes that is compromising to financial stability as well as investment strategies in the power market [58]. Particularly for countries such as Germany, where the larger percentage of electricity is produced by renewable energy sources [37], levels of sunlight and wind impact electricity generation and thus prices. This introduces, in addition to the physical problem of balancing the grid, non-stationarity to most price models, which further adds unreliability to the predictions. Accurate electricity price forecasting is crucial for efficient resource planning, financial risk management, and stabilization of the market, especially with increasing renewable energy penetration, which enables utilities, businesses, and governments to optimize planning and policy maximization while matching demand and supply. The building of an adequate prediction model, which is relatively straightforward and understandable but at the same time can reflect the market complexity and all influence factors engaged in it is not straightforward, and authors have utilized quite broadly three types of model for prediction: statistical/(probability-based) models [12], machine learning/deep learning models [42], and mixed models [30]. Precise forecasting allows the players in the market to make sound monetary policy.

artificial intelligence, machine learning, regime, (21 more...)

arXiv.org Machine Learning

2508.0004

Country:

Europe (1.00)
North America > United States > New York (0.28)

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.46)

Industry:

Energy > Power Industry (1.00)
Energy > Oil & Gas > Trading (0.67)
Energy > Renewable > Solar (0.46)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(4 more...)

SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation

Ramkumar, Prerana

Generative Adversarial Networks (GANs) have achieved realistic super-resolution (SR) of images however, they lack semantic consistency and per-pixel confidence, limiting their credibility in critical remote sensing applications such as disaster response, urban planning and agriculture. This paper introduces Semantic and Uncertainty-Aware ESRGAN (SU-ESRGAN), the first SR framework designed for satellite imagery to integrate the ESRGAN, segmentation loss via DeepLabv3 for class detail preservation and Monte Carlo dropout to produce pixel-wise uncertainty maps. The SU-ESRGAN produces results (PSNR, SSIM, LPIPS) comparable to the Baseline ESRGAN on aerial imagery. This novel model is valuable in satellite systems or UAVs that use wide field-of-view (FoV) cameras, trading off spatial resolution for coverage. The modular design allows integration in UAV data pipelines for on-board or post-processing SR to enhance imagery resulting due to motion blur, compression and sensor limitations. Further, the model is fine-tuned to evaluate its performance on cross domain applications. The tests are conducted on two drone based datasets which differ in altitude and imaging perspective. Performance evaluation of the fine-tuned models show a stronger adaptation to the Aerial Maritime Drone Dataset, whose imaging characteristics align with the training data, highlighting the importance of domain-aware training in SR-applications.

artificial intelligence, esrgan, machine learning, (14 more...)

2508.0075

Country: Asia > Middle East > UAE (0.15)

Genre: Research Report (1.00)

Industry:

Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.56)
Food & Agriculture > Agriculture (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.51)

Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network

Cho, Young-ho, Zhu, Hao, Lee, Duehee, Baldick, Ross

--For conducting resource adequacy studies, we synthesize multiple long-term wind power scenarios of distributed wind farms simultaneously by using the spatio-temporal features: spatial and temporal correlation, waveforms, marginal and ramp rates distributions of waveform, power spectral densities, and statistical characteristics. Generating the spatial correlation in scenarios requires the design of common factors for neighboring wind farms and antithetical factors for distant wind farms. The generalized dynamic factor model (GDFM) can extract the common factors through cross spectral density analysis, but it cannot closely imitate waveforms. The GAN can synthesize plausible samples representing the temporal correlation by verifying samples through a fake sample discriminator . T o combine the advantages of GDFM and GAN, we use the GAN to provide a filter that extracts dynamic factors with temporal information from the observation data, and we then apply this filter in the GDFM to represent both spatial and frequency correlations of plausible waveforms. Numerical tests on the combination of GDFM and GAN have demonstrated performance improvements over competing alternatives in synthesizing wind power scenarios from Australia, better realizing plausible statistical characteristics of actual wind power compared to alternatives such as the GDFM with a filter synthesized from distributions of actual dynamic filters and the GAN with direct synthesis without dynamic factors. ESOURCE adequacy means to maintain power system reliability by having sufficient capacity such that, even with failures or variability of resources, the probability of not being able to meet all load is sufficiently small [1]. System operators achieve resource adequacy of a power system by ensuring there is enough generation capacity [2]. In the case of intermittent energy resources, the effective load carrying capacity (ELCC) of the intermittent resource is the equivalent capacity of highly reliable generators that would result in the same probability of not being able to meet all load [3]. For example, the ELCC of wind power can be obtained by simulating power systems with long-term wind power scenarios with realistic ramping rates and marginal distributions [4]. Furthermore, the capacity factor and reserve margin contribution of wind power to the power system reliability can also be obtained by simulating a future power system by using realistic long-term wind power scenarios [5].

artificial intelligence, machine learning, scenario, (16 more...)

2508.00692

Country: Oceania > Australia (0.24)

Genre: Research Report (0.64)

Industry: Energy > Renewable > Wind (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Tresson, Paul, Coz, Pierre Le, Tulet, Hadrien, Malkassian, Anthony, Méchain, Maxime Réjou

IAMAP: Unlocking Deep Learning in QGIS for non-coders and limited computing resources

Remote sensing has entered a new era with the rapid development of artificial intelligence approaches. However, the implementation of deep learning has largely remained restricted to specialists and has been impractical because it often requires (i) large reference datasets for model training and validation; (ii) substantial computing resources; and (iii) strong coding skills. Here, we introduce IAMAP, a user-friendly QGIS plugin that addresses these three challenges in an easy yet flexible way. IAMAP builds on recent advancements in self-supervised learning strategies, which now provide robust feature extractors, often referred to as foundation models. These generalist models can often be reliably used in few-shot or zero-shot scenarios (i.e., with little to no fine-tuning). IAMAP's interface allows users to streamline several key steps in remote sensing image analysis: (i) extracting image features using a wide range of deep learning architectures; (ii) reducing dimensionality with built-in algorithms; (iii) performing clustering on features or their reduced representations; (iv) generating feature similarity maps; and (v) calibrating and validating supervised machine learning models for prediction. By enabling non-AI specialists to leverage the high-quality features provided by recent deep learning approaches without requiring GPU capacity or extensive reference datasets, IAMAP contributes to the democratization of computationally efficient and energy-conscious deep learning methods.

artificial intelligence, machine learning, plugin, (13 more...)

2508.00627

Country:

Asia > Thailand (0.15)
Europe > France (0.14)

Genre: Research Report (0.64)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Roux, Quentin Le, Teglia, Yannick, Furon, Teddy, Loubet-Moundi, Philippe

Backdoor Attacks on Deep Learning Face Detection

--Face Recognition Systems that operate in unconstrained environments capture images under varying conditions, such as inconsistent lighting, or diverse face poses. These challenges require including a Face Detection module that regresses bounding boxes and landmark coordinates for proper Face Alignment. This paper shows the effectiveness of Object Generation Attacks on Face Detection, dubbed Face Generation Attacks, and demonstrates for the first time a Landmark Shift Attack that backdoors the coordinate regression task performed by face detectors. We then offer mitigations against these vulnerabilities. Deep Neural Networks (DNNs) have considerably influenced both academic research and a wide range of industries. The rapid growth in computational power and dataset availability leads to large-scale Machine Learning applications, such as anomaly detection in server farms and power plants [1], [2]. This technological change has also transformed Face Recognition, with modern Face Recognition Systems (FRSs) increasingly leveraging DNNs, e.g., to secure access to sensitive facilities [3]. Developing Machine Learning pipelines requires a costly combination of domain expertise, computational resources, and data access. The first casualty of these rising Machine Learning demands is often security.

artificial intelligence, landmark shift attack, machine learning, (11 more...)

2508.0062

Country: Europe > France (0.14)

Genre:

Research Report (0.65)
Overview (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Energy (0.88)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Plotas, Konstantinos, Papadakis, Emmanouil, Drosakis, Drosakis, Trahanias, Panos, Papageorgiou, Dimitrios

A control scheme for collaborative object transportation between a human and a quadruped robot using the MIGHTY suction cup

Please find the citation info @ Zenodo, as the proceedings of ICRA are no longer sent to IEEE Xplore. This is a pre-print version of the paper presented at IEEE International Conference on Robotics and Automation 2025 (ICRA), Atlanta, US. Abstract -- In this work, a control scheme for human-robot collaborative object transportation is proposed, considering a quadruped robot equipped with the MIGHTY suction cup that serves both as a gripper for holding the object and a force/torque sensor . The proposed control scheme is based on the notion of admittance control, and incorporates a variable damping term aiming towards increasing the controllability of the human and, at the same time, decreasing her/his effort. Furthermore, to ensure that the object is not detached from the suction cup during the collaboration, an additional control signal is proposed, which is based on a barrier artificial potential. The proposed control scheme is proven to be passive and its performance is demonstrated through experimental evaluations conducted using the Unitree Go1 robot equipped with the MIGHTY suction cup.

artificial intelligence, robot, suction cup, (17 more...)

doi: 10.5281/zenodo.16621109

2508.00584

Country: Europe (0.28)

Genre: Research Report (0.50)

Industry:

Health & Medicine (0.68)
Energy (0.68)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (0.87)

Rani, Anju, Ortiz-Arroyo, Daniel, Durdevic, Petar

CLIPTime: Time-Aware Multimodal Representation Learning from Images and Text

Understanding the temporal dynamics of biological growth is critical across diverse fields such as microbiology, agriculture, and biodegradation research. Although vision-language models like Contrastive Language Image Pretraining (CLIP) have shown strong capabilities in joint visual-textual reasoning, their effectiveness in capturing temporal progression remains limited. To address this, we propose CLIPTime, a multimodal, multitask framework designed to predict both the developmental stage and the corresponding timestamp of fungal growth from image and text inputs. Built upon the CLIP architecture, our model learns joint visual-textual embeddings and enables time-aware inference without requiring explicit temporal input during testing. To facilitate training and evaluation, we introduce a synthetic fungal growth dataset annotated with aligned timestamps and categorical stage labels. CLIPTime jointly performs classification and regression, predicting discrete growth stages alongside continuous timestamps. We also propose custom evaluation metrics, including temporal accuracy and regression error, to assess the precision of time-aware predictions. Experimental results demonstrate that CLIPTime effectively models biological progression and produces interpretable, temporally grounded outputs, highlighting the potential of vision-language models in real-world biological monitoring applications.

large language model, machine learning, natural language, (16 more...)

2508.00447

Country: Europe > Denmark (0.15)

Genre: Research Report > New Finding (0.48)

Industry:

Energy (0.69)
Health & Medicine > Pharmaceuticals & Biotechnology (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)