Goto

Collaborating Authors

 Energy


Distributed games with jumps: An $α$-potential game approach

arXiv.org Artificial Intelligence

Motivated by game-theoretic models of crowd motion dynamics, this paper analyzes a broad class of distributed games with jump diffusions within the recently developed $α$-potential game framework. We demonstrate that analyzing the $α$-Nash equilibria reduces to solving a finite-dimensional control problem. Beyond the viscosity and verification characterizations for the general games, we explicitly and in detail examine how spatial population distributions and interaction rules influence the structure of $α$-Nash equilibria in these distributed settings, and in particular for crowd motion games. Our theoretical results are supported by numerical implementations using policy gradient-based algorithms, further demonstrating the computational advantages of the $α$-potential game framework in computing Nash equilibria for general dynamic games.


Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting

arXiv.org Artificial Intelligence

Diffusion models have demonstrated strong performance in time series forecasting, yet often suffer from semantic misalignment between generated trajectories and conditioning covariates, especially under complex or multimodal conditions. To address this issue, we propose SemGuide, a plug-and-play, inference-time method that enhances covariate consistency in conditional diffusion models. Our approach introduces a scoring network to assess the semantic alignment between intermediate diffusion states and future covariates. These scores serve as proxy likelihoods in a stepwise importance reweighting procedure, which progressively adjusts the sampling path without altering the original training process. The method is model-agnostic and compatible with any conditional diffusion framework. Experiments on real-world forecasting tasks show consistent gains in both predictive accuracy and covariate alignment, with especially strong performance under complex conditioning scenarios.


Energy-Predictive Planning for Optimizing Drone Service Delivery

arXiv.org Artificial Intelligence

Energy-Predictive Planning for Optimizing Drone Service Delivery Guanting Ren, Babar Shahzaad, Balsam Alkouz, Abdallah Lakhdari, Ath-man Bouguettaya An Energy-Predictive Drone Service (EPDS) framework to minimize the average delivery time. A heuristic-based optimization for drone services composition to reduce recharging and waiting time. Abstract We propose a novel Energy-Predictive Drone Service (EPDS) framework for efficient package delivery within a skyway network. The EPDS framework incorporates a formal modeling of an EPDS and an adaptive bidirectional Long Short-Term Memory (Bi-LSTM) machine learning model. This model predicts the energy status and stochastic arrival times of other drones operating in the same skyway network. Leveraging these predictions, we develop a heuristic optimization approach for composite drone services. This approach identifies the most time-efficient and energy-efficient skyway path and recharging schedule for each drone in the network. We conduct extensive experiments using a real-world drone flight dataset to evaluate the performance of the proposed framework. Introduction The Internet of Things (IoT) has become more mature and widespread, largely thanks to advancements in software and hardware technologies. Drones serve various purposes, including aiding in farm irrigation, capturing aerial imagery for entertainment, and facilitating the delivery of retail goods (Mohsan et al. (2023)). Drone delivery services are increasingly important because they can offer faster delivery times, lower operational costs, and potentially a greener alternative to traditional delivery methods (Eskandaripour and Boldsaikhan (2023)). Several key challenges, however, hinder the wider adoption of drones for delivery services (Sah et al. (2021)). A primary challenge is constrained battery capacity, which limits a drone's flight range (Huang et al. (2022)). With current lightweight batteries, delivery drones are not well-suited for long-distance trips, particularly when carrying heavy payloads. As a result, some studies propose using drones only for last-mile deliveries (Garg et al. (2023)). Despite these limitations, drones remain a clean, cost-effective, and ubiquitous alternative to land-based delivery in both urban and rural areas (Attenni et al. (2023)).


Referring Remote Sensing Image Segmentation with Cross-view Semantics Interaction Network

arXiv.org Artificial Intelligence

Recently, Referring Remote Sensing Image Segmentation (RRSIS) has aroused wide attention. To handle drastic scale variation of remote targets, existing methods only use the full image as input and nest the saliency-preferring techniques of cross-scale information interaction into traditional single-view structure. Although effective for visually salient targets, they still struggle in handling tiny, ambiguous ones in lots of real scenarios. In this work, we instead propose a paralleled yet unified segmentation framework Cross-view Semantics Interaction Network (CSINet) to solve the limitations. Motivated by human behavior in observing targets of interest, the network orchestrates visual cues from remote and close distances to conduct synergistic prediction. In its every encoding stage, a Cross-View Window-attention module (CVWin) is utilized to supplement global and local semantics into close-view and remote-view branch features, finally promoting the unified representation of feature in every encoding stage. In addition, we develop a Collaboratively Dilated Attention enhanced Decoder (CDAD) to mine the orientation property of target and meanwhile integrate cross-view multiscale features. The proposed network seamlessly enhances the exploitation of global and local semantics, achieving significant improvements over others while maintaining satisfactory speed.


GraphVSSM: Graph Variational State-Space Model for Probabilistic Spatiotemporal Inference of Dynamic Exposure and Vulnerability for Regional Disaster Resilience Assessment

arXiv.org Artificial Intelligence

Regional disaster resilience quantifies the changing nature of physical risks to inform policy instruments ranging from local immediate recovery to international sustainable development. While many existing state-of-practice methods have greatly advanced the dynamic mapping of exposure and hazard, our understanding of large-scale physical vulnerability has remained static, costly, limited, region-specific, coarse-grained, overly aggregated, and inadequately calibrated. With the significant growth in the availability of time-series satellite imagery and derived products for exposure and hazard, we focus our work on the equally important yet challenging element of the risk equation: physical vulnerability. We leverage machine learning methods that flexibly capture spatial contextual relationships, limited temporal observations, and uncertainty in a unified probabilistic spatiotemporal inference framework. We therefore introduce Graph Variational State-Space Model (GraphVSSM), a novel modular spatiotemporal approach that uniquely integrates graph deep learning, state-space modeling, and variational inference using time-series data and prior expert belief systems in a weakly supervised or coarse-to-fine-grained manner. We present three major results: a city-wide demonstration in Quezon City, Philippines; an investigation of sudden changes in the cyclone-impacted coastal Khurushkul community (Bangladesh) and mudslide-affected Freetown (Sierra Leone); and an open geospatial dataset, METEOR 2.5D, that spatiotemporally enhances the existing global static dataset for UN Least Developed Countries (2020). Beyond advancing regional disaster resilience assessment and improving our understanding global disaster risk reduction progress, our method also offers a probabilistic deep learning approach, contributing to broader urban studies that require compositional data analysis in weak supervision.


PHM-Bench: A Domain-Specific Benchmarking Framework for Systematic Evaluation of Large Models in Prognostics and Health Management

arXiv.org Artificial Intelligence

With the rapid advancement of generative artificial intelligence, large language models (LLMs) are increasingly adopted in industrial domains, offering new opportunities for Prognostics and Health Management (PHM), addressing challenges such as high development costs, long deployment cycles, and limited generalizability. However, despite the grow ing synergy between PHM and LLM s, existing evaluation methodologies often fall short regarding structural completeness, dimensional comprehensiveness, and evaluatio n granularity, severely hampering the in - depth integration of LLMs into the PHM domain. To address these limitations, this study, drawing upon two decades of PHM research and recent advancements in LLM - driven PHM systems, proposes PHM - Bench, a novel three - dimensional evaluation framework for PHM - oriented large models. Grounded in the triadic structure of fundamental capabilit y, core task, and entire lifecycle, PHM - Bench is designed specifically for the unique demands of PHM system engineering. It systematically defines multi - level evaluation metrics spanning knowledge comprehension, algorithmi c generation, task optimization, etc., aligning with typical PHM tasks including condition monitoring, fault diagnosis, fault & RUL prediction, and maintenance decision - making, thus establishing a comprehensive assessment mechanism, bridg ing complex engineering systems' design, development, and operational stages. Utilizing both self - constructed case sets and publicly available industrial dataset s, PHM - Bench enables multi - dimensional evaluation of general - purpose and domain - specific models across diverse PHM tasks. Experiments demonstrate its effectiveness in revealing model capabilities and limitations, distinguishing performance across tasks, and providing a unified baseline for model development and optimization. PHM - Bench lays the methodological foundation for industrial - scale assessment of LLMs in PHM and offers a critical benchmark to guide the transition from general - purpose to PHM - specialized models.


NMS: Efficient Edge DNN Training via Near-Memory Sampling on Manifolds

arXiv.org Artificial Intelligence

--Training deep neural networks (DNNs) on edge devices has attracted increasing attention due to its potential to address challenges related to domain adaptation and privacy preservation. However, DNNs typically rely on large datasets for training, which results in substantial energy consumption, making the training in edge devices impractical. Some dataset compression methods have been proposed to solve this challenge. For instance, the coreset selection and dataset distillation reduce the training cost by selecting and generating representative samples respectively. Nevertheless, these methods have two significant defects: (1) The necessary of leveraging a DNN model to evaluate the quality of representative samples, which inevitably introduces inductive bias of DNN, resulting in a severe generalization issue; (2) All training images require multiple accesses to the DDR via long-distance PCB connections, leading to substantial energy overhead. T o address these issues, inspired by the nonlinear manifold stationary of the human brain, we firstly propose a DNN-free sample-selecting algorithm, called DE-SNE, to improve the generalization issue. Secondly, we innovatively utilize the near-memory computing technique to implement DE-SNE, thus only a small fraction of images need to access the DDR via long-distance PCB. It significantly reduces DDR energy consumption. As a result, we build a novel expedited DNN training system with a more efficient in-place Near-Memory Sampling characteristic for edge devices, dubbed NMS. As far as we know, our NMS is the first DNN-free near-memory sampling technique that can effectively alleviate generalization issues and significantly reduce DDR energy caused by dataset access. The experimental results show that our NMS outperforms the current state-of-the-art (SOT A) approaches, namely DQ, DQAS, and NeSSA, in model accuracy.


The Complexity of Extreme Climate Events on the New Zealand's Kiwifruit Industry

arXiv.org Artificial Intelligence

Climate change has intensified the frequency and severity of extreme weather events, presenting unprecedented challenges to the agricultural industry worldwide. In this investigation, we focus on kiwifruit farming in New Zealand. We propose to examine the impacts of climate-induced extreme events, specifically frost, drought, extreme rainfall, and heatwave, on kiwifruit harvest yields. These four events were selected due to their significant impacts on crop productivity and their prevalence as recorded by climate monitoring institutions in the country. We employed Isolation Forest, an unsupervised anomaly detection method, to analyse climate history and recorded extreme events, alongside with kiwifruit yields. Our analysis reveals considerable variability in how different types of extreme event affect kiwifruit yields underscoring notable discrepancies between climatic extremes and individual farm's yield outcomes. Additionally, our study highlights critical limitations of current anomaly detection approaches, particularly in accurately identifying events such as frost. These findings emphasise the need for integrating supplementary features like farm management strategies with climate adaptation practices. Our further investigation will employ ensemble methods that consolidate nearby farms' yield data and regional climate station features to reduce variance, thereby enhancing the accuracy and reliability of extreme event detection and the formulation of response strategies.


EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses

arXiv.org Artificial Intelligence

All-day smart glasses are likely to emerge as platforms capable of continuous contextual sensing, uniquely positioning them for unprecedented assistance in our daily lives. Integrating the multi-modal AI agents required for human memory enhancement while performing continuous sensing, however, presents a major energy efficiency challenge for all-day usage. Achieving this balance requires intelligent, context-aware sensor management. Our approach, EgoTrigger, leverages audio cues from the microphone to selectively activate power-intensive cameras, enabling efficient sensing while preserving substantial utility for human memory enhancement. EgoTrigger uses a lightweight audio model (YAMNet) and a custom classification head to trigger image capture from hand-object interaction (HOI) audio cues, such as the sound of a drawer opening or a medication bottle being opened. In addition to evaluating on the QA-Ego4D dataset, we introduce and evaluate on the Human Memory Enhancement Question-Answer (HME-QA) dataset. Our dataset contains 340 human-annotated first-person QA pairs from full-length Ego4D videos that were curated to ensure that they contained audio, focusing on HOI moments critical for contextual understanding and memory. Our results show EgoTrigger can use 54% fewer frames on average, significantly saving energy in both power-hungry sensing components (e.g., cameras) and downstream operations (e.g., wireless transmission), while achieving comparable performance on datasets for an episodic memory task. We believe this context-aware triggering strategy represents a promising direction for enabling energy-efficient, functional smart glasses capable of all-day use -- supporting applications like helping users recall where they placed their keys or information about their routine activities (e.g., taking medications).


Clustered Federated Learning for Generalizable FDIA Detection in Smart Grids with Heterogeneous Data

arXiv.org Artificial Intelligence

--False Data Injection Attacks (FDIAs) pose severe security risks to smart grids by manipulating measurement data collected from spatially distributed devices such as SCADA systems and PMUs. These measurements typically exhibit Non-Independent and Identically Distributed (Non-IID) characteristics across different regions, which significantly challenges the generalization ability of detection models. Traditional centralized training approaches not only face privacy risks and data sharing constraints but also incur high transmission costs, limiting their scalability and deployment feasibility. T o address these issues, this paper proposes a privacy-preserving federated learning framework, termed Federated Cluster A verage (FedClusA vg), designed to improve FDIA detection in Non-IID and resource-constrained environments. FedClusA vg incorporates cluster-based stratified sampling and hierarchical communication (client-subserver-server) to enhance model generalization and reduce communication overhead. By enabling localized training and weighted parameter aggregation, the algorithm achieves accurate model convergence without centralizing sensitive data. Experimental results on benchmark smart grid datasets demonstrate that FedClusA vg not only improves detection accuracy under heterogeneous data distributions but also significantly reduces communication rounds and bandwidth consumption. This work provides an effective solution for secure and efficient FDIA detection in large-scale distributed power systems. S an important cyber-physical system (CPS), smart grid is highly vulnerable to cyber attacks [1].