Goto

Collaborating Authors

 elsevier


I-GLIDE: Input Groups for Latent Health Indicators in Degradation Estimation

Thil, Lucas, Read, Jesse, Kaddah, Rim, Doquet, Guillaume

arXiv.org Artificial Intelligence

Accurate remaining useful life (RUL) prediction hinges on the quality of health indicators (HIs), yet existing methods often fail to disentangle complex degradation mechanisms in multi-sensor systems or quantify uncertainty in HI reliability. This paper introduces a novel framework for HI construction, advancing three key contributions. First, we adapt Reconstruction along Projected Pathways (RaPP) as a health indicator (HI) for RUL prediction for the first time, showing that it outperforms traditional reconstruction error metrics. Second, we show that augmenting RaPP-derived HIs with aleatoric and epistemic uncertainty quantification (UQ)--via Monte Carlo dropout and probabilistic latent spaces-- significantly improves RUL-prediction robustness. Third, and most critically, we propose indicator groups, a paradigm that isolates sensor subsets to model system-specific degradations, giving rise to our novel method, I-GLIDE which enables interpretable, mechanism-specific diagnostics. Evaluated on data sourced from aerospace and manufacturing systems, our approach achieves marked improvements in accuracy and generalizability compared to state-of-the-art HI methods while providing actionable insights into system failure pathways.


An Evaluation Framework for Network IDS/IPS Datasets: Leveraging MITRE ATT&CK and Industry Relevance Metrics

Tori, Adrita Rahman, Hasan, Khondokar Fida

arXiv.org Artificial Intelligence

The performance of Machine Learning (ML) and Deep Learning (DL)-based Intrusion Detection and Prevention Systems (IDS/IPS) is critically dependent on the relevance and quality of the datasets used for training and evaluation. However, current AI model evaluation practices for developing IDS/IPS focus predominantly on accuracy metrics, often overlooking whether datasets represent industry-specific threats. To address this gap, we introduce a novel multi-dimensional framework that integrates the MITRE ATT&CK knowledge base for threat intelligence and employs five complementary metrics that together provide a comprehensive assessment of dataset suitability. Methodologically, this framework combines threat intelligence, natural language processing, and quantitative analysis to assess the suitability of datasets for specific industry contexts. Applying this framework to nine publicly available IDS/IPS datasets reveals significant gaps in threat coverage, particularly in the healthcare, energy, and financial sectors. In particular, recent datasets (e.g., CIC-IoMT, CIC-UNSW-NB15) align better with sector-specific threats, whereas others, like CICIoV-24, underperform despite their recency. Our findings provide a standardized, interpretable approach for selecting datasets aligned with sector-specific operational requirements, ultimately enhancing the real-world effectiveness of AI-driven IDS/IPS deployments. The efficiency and practicality of the framework are validated through deployment in a real-world case study, underscoring its capacity to inform dataset selection and enhance the effectiveness of AI-driven IDS/IPS in operational environments.



Coupling Agent-based Modeling and Life Cycle Assessment to Analyze Trade-offs in Resilient Energy Transitions

Zhang, Beichen, Zaki, Mohammed T., Breunig, Hanna, Ajami, Newsha K.

arXiv.org Artificial Intelligence

Transitioning to sustainable and resilient energy systems requires navigating complex and interdependent trade-offs across environmental, social, and resource dimensions. Neglecting these trade-offs can lead to unintended consequences across sectors. However, existing assessments often evaluate emerging energy pathways and their impacts in silos, overlooking critical interactions such as regional resource competition and cumulative impacts. We present an integrated modeling framework that couples agent-based modeling and Life Cycle Assessment (LCA) to simulate how energy transition pathways interact with regional resource competition, ecological constraints, and community-level burdens. We apply the model to a case study in Southern California. The results demonstrate how integrated and multiscale decision making can shape energy pathway deployment and reveal spatially explicit trade-offs under scenario-driven constraints. This modeling framework can further support more adaptive and resilient energy transition planning on spatial and institutional scales.


Physiologically Active Vegetation Reverses Its Cooling Effect in Humid Urban Climates

Borah, Angana, Datta, Adrija, Kumar, Ashish S., Dave, Raviraj, Bhatia, Udit

arXiv.org Artificial Intelligence

Efforts to green cities for cooling are succeeding unevenly because the same vegetation that cools surfaces can also intensify how hot the air feels. Previous studies have identified humid heat as a growing urban hazard, yet how physiologically active vegetation governs this trade-off between cooling and moisture accumulation remains poorly understood, leaving mitigation policy and design largely unguided. Here we quantify how vegetation structure and function influence the Heat Index (HI), a combined measure of temperature and humidity in 138 Indian cities spanning tropical savanna, semi-arid steppe, and humid subtropical climates, and across dense urban cores and semi-urban rings. Using an extreme-aware, one kilometre reconstruction of HI and an interpretable machine-learning framework that integrates SHapley Additive Explanations (SHAP) and Accumulated Local Effects (ALE), we isolate vegetation-climate interactions. Cooling generally strengthens for EVI >= 0.4 and LAI >= 0.05, but joint-high regimes begin to reverse toward warming when EVI >= 0.5, LAI >= 0.2, and fPAR >= 0.5,with an earlier onset for fPAR >= 0.25 in humid, dense cores. In such environments, highly physiologically active vegetation elevates near-surface humidity faster than it removes heat, reversing its cooling effect and amplifying perceived heat stress. These findings establish the climatic limits of vegetation-driven cooling and provide quantitative thresholds for climate-specific greening strategies that promote equitable and heat-resilient cities.


Potential Indicator for Continuous Emotion Arousal by Dynamic Neural Synchrony

Pan, Guandong, Wu, Zhaobang, Yang, Yaqian, Wang, Xin, Liu, Longzhao, Zheng, Zhiming, Tang, Shaoting

arXiv.org Artificial Intelligence

The need for automatic and high-quality emotion annotation is paramount in applications such as continuous emotion rec ognition and video highlight detection, yet achieving this through manu al human annotations is challenging. Inspired by inter-subject corre lation (ISC) utilized in neuroscience, this study introduces a novel Electr oencephalog-raphy (EEG) based ISC methodology that leverages a single-e lectrode and feature-based dynamic approach. Our contributions are three folds: Firstly, we reidentify two potent emotion features suitabl e for classifying emotions--first-order difference (FD) an differential entrop y (DE). Secondly, through the use of overall correlation analysis, we d emonstrate the heterogeneous synchronized performance of electrodes. Th is performance aligns with neural emotion patterns established in prior st udies, thus validating the effectiveness of our approach. Thirdly, by emplo ying a sliding window correlation technique, we showcase the significant c onsistency of dynamic ISCs across various features or key electrodes in ea ch analyzed film clip. Our findings indicate the method's reliability in c apturing consistent, dynamic shared neural synchrony among individual s, triggered by evocative film stimuli. This underscores the potential of our approach to serve as an indicator of continuous human emotion arousal . The implications of this research are significant for advancement s in affective computing and the broader neuroscience field, suggesting a s treamlined and effective tool for emotion analysis in real-world applic ations. 2 G. Pan et al.


PIPES: A Meta-dataset of Machine Learning Pipelines

Maia, Cynthia Moreira, de Amorim, Lucas B. V., Cavalcanti, George D. C., Cruz, Rafael M. O.

arXiv.org Artificial Intelligence

Solutions to the Algorithm Selection Problem (ASP) in machine learning face the challenge of high computational costs associated with evaluating various algorithms' performances on a given dataset. To mitigate this cost, the meta-learning field can leverage previously executed experiments shared in online repositories such as OpenML. OpenML provides an extensive collection of machine learning experiments. However, an analysis of OpenML's records reveals limitations. It lacks diversity in pipelines, specifically when exploring data preprocessing steps/blocks, such as scaling or imputation, resulting in limited representation. Its experiments are often focused on a few popular techniques within each pipeline block, leading to an imbalanced sample. To overcome the observed limitations of OpenML, we propose PIPES, a collection of experiments involving multiple pipelines designed to represent all combinations of the selected sets of techniques, aiming at diversity and completeness. PIPES stores the results of experiments performed applying 9,408 pipelines to 300 datasets. It includes detailed information on the pipeline blocks, training and testing times, predictions, performances, and the eventual error messages. This comprehensive collection of results allows researchers to perform analyses across diverse and representative pipelines and datasets. PIPES also offers potential for expansion, as additional data and experiments can be incorporated to support the meta-learning community further. The data, code, supplementary material, and all experiments can be found at https://github.com/cynthiamaia/PIPES.git.


A Structured Review of Underwater Object Detection Challenges and Solutions: From Traditional to Large Vision Language Models

Nabahirwa, Edwine, Song, Wei, Zhang, Minghua, Fang, Yi, Ni, Zhou

arXiv.org Artificial Intelligence

Despite its significance, the underwater world remains largely overlooked as a result of the challenging conditions that hinder traditional research methods. Historically, the study of marine ecosystems relied on labor intensive research [1], which provided limited data and had a high error margin. In recent years, advances in autonomous and remotely operated vehicles (AUVs and ROVs) have revolutionized underwater exploration. These technologies, equipped with object detection systems, now allow real-time monitoring, which includes capturing images of marine organisms, environmental conditions, and even assessing biodiversity [2], [3]. However, the quality of images and videos captured underwater remains a significant obstacle. Light absorption, scattering, and water-related distortions, such as haze and color shifts [4], create noisy low-contrast images, further compounded by complex underwater backgrounds and camera motion. These challenges call for advanced detection techniques capable of accurately identifying and localizing objects despite underwater noise. Efficient underwater object detection (UOD) is crucial for a variety of marine applications, including biodiversity monitoring, conservation efforts, and resource management.


Evaluating Contrast Localizer for Identifying Causal Units in Social & Mathematical Tasks in Language Models

Jamaa, Yassine, AlKhamissi, Badr, Ghosh, Satrajit, Schrimpf, Martin

arXiv.org Artificial Intelligence

This work adapts a neuroscientific contrast localizer to pinpoint causally relevant units for Theory of Mind (ToM) and mathematical reasoning tasks in large language models (LLMs) and vision-language models (VLMs). Across 11 LLMs and 5 VLMs ranging in size from 3B to 90B parameters, we localize top-activated units using contrastive stimulus sets and assess their causal role via targeted ablations. We compare the effect of lesioning functionally selected units against low-activation and randomly selected units on downstream accuracy across established ToM and mathematical benchmarks. Contrary to expectations, low-activation units sometimes produced larger performance drops than the highly activated ones, and units derived from the mathematical localizer often impaired ToM performance more than those from the ToM localizer. These findings call into question the causal relevance of contrast-based localizers and highlight the need for broader stimulus sets and more accurately capture task-specific units.


Physics-Based Explainable AI for ECG Segmentation: A Lightweight Model

Sidiq, Muhammad Fathur Rohman, Abdurrouf, null, Santoso, Didik Rahadi

arXiv.org Artificial Intelligence

Physics - Based Explainable AI for ECG Segmentation: A Lightweight Model Muhammad Fathur Rohman Sidiq Department of Physics, Faculty of Mathematics and Science, Brawijaya University, Malang, Indonesia Abdurrouf Department of Physics, Faculty of Mathematics and Science, Brawijaya University, Malang, Indonesia Didik Rahadi Santoso * Department of Physics, Faculty of Mathematics and Science, Brawijaya University, Malang, Indonesia * Corresponding author. E - mail: dieks@ub.ac.id Abstract The heart's electrical activity, recorded through Electrocardiography (ECG), is essential for diagnosing various cardiovascular conditions. However, many existing ECG segmentation models rely on complex, multi - layered architectures such as BiLSTM, which ar e computationally intensive and inefficient. This study introduces a streamlined architecture that combines spectral analysis with probabilistic predictions for ECG signal segmentation. Additionally, an Explainable AI (XAI) approach is applied to enhance model interpretability by explaining how temporal and frequency - based features contribute to ECG segmentation. By i ncorporating principles from physics - based AI, this method provides a clear understanding of the decision - making process, ensuring reliability and transparency in ECG analysis.