Chen, Shengyong
Prototype-based Heterogeneous Federated Learning for Blade Icing Detection in Wind Turbines with Class Imbalanced Data
Qi, Lele, Liu, Mengna, Cheng, Xu, Shi, Fan, Liu, Xiufeng, Chen, Shengyong
N effective strategy to reduce carbon emissions is to replace traditional fossil fuels by developing clean renewable Traditional federated learning (FL) offers an effective solution energy sources. Among renewable energy sources, wind to data privacy disclosure issue in centralized data-driven energy stands out as one of the most significant, alongside methods. Under the FL framework, each turbine contributes hydropower [1]. Therefore, the efficient operation of wind its own data to jointly train a global model without direct turbines is crucial to maximize energy output. To optimize data exchange [10]. This collaborative learning method avoids the harnessing of wind energy, wind farms are commonly centralized data storage and protects the privacy and security established on ridges, mountaintops, or other elevated areas. of data. FL has already been first applied to detect blade icing The low-temperature climate in these areas can lead to blade in wind turbines using a heterogeneous framework [11].
PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning
Pan, Jiancheng, Ma, Muyuan, Ma, Qing, Bai, Cong, Chen, Shengyong
Remote sensing image-text retrieval constitutes a foundational aspect of remote sensing interpretation tasks, facilitating the alignment of vision and language representations. This paper introduces a prior instruction representation (PIR) learning paradigm that draws on prior knowledge to instruct adaptive learning of vision and text representations. Based on PIR, a domain-adapted remote sensing image-text retrieval framework PIR-ITR is designed to address semantic noise issues in vision-language understanding tasks. However, with massive additional data for pre-training the vision-language foundation model, remote sensing image-text retrieval is further developed into an open-domain retrieval task. Continuing with the above, we propose PIR-CLIP, a domain-specific CLIP-based framework for remote sensing image-text retrieval, to address semantic noise in remote sensing vision-language representations and further improve open-domain retrieval performance. In vision representation, Vision Instruction Representation (VIR) based on Spatial-PAE utilizes the prior-guided knowledge of the remote sensing scene recognition by building a belief matrix to select key features for reducing the impact of semantic noise. In text representation, Language Cycle Attention (LCA) based on Temporal-PAE uses the previous time step to cyclically activate the current time step to enhance text representation capability. A cluster-wise Affiliation Loss (AL) is proposed to constrain the inter-classes and to reduce the semantic confusion zones in the common subspace. Comprehensive experiments demonstrate that PIR could enhance vision and text representations and outperform the state-of-the-art methods of closed-domain and open-domain retrieval on two benchmark datasets, RSICD and RSITMD.
Disentangling Imperfect: A Wavelet-Infused Multilevel Heterogeneous Network for Human Activity Recognition in Flawed Wearable Sensor Data
Liu, Mengna, Xiang, Dong, Cheng, Xu, Liu, Xiufeng, Zhang, Dalin, Chen, Shengyong, Jensen, Christian S.
The popularity and diffusion of wearable devices provides new opportunities for sensor-based human activity recognition that leverages deep learning-based algorithms. Although impressive advances have been made, two major challenges remain. First, sensor data is often incomplete or noisy due to sensor placement and other issues as well as data transmission failure, calling for imputation of missing values, which also introduces noise. Second, human activity has multi-scale characteristics. Thus, different groups of people and even the same person may behave differently under different circumstances. To address these challenges, we propose a multilevel heterogeneous neural network, called MHNN, for sensor data analysis. We utilize multilevel discrete wavelet decomposition to extract multi-resolution features from sensor data. This enables distinguishing signals with different frequencies, thereby suppressing noise. As the components resulting from the decomposition are heterogeneous, we equip the proposed model with heterogeneous feature extractors that enable the learning of multi-scale features. Due to the complementarity of these features, we also include a cross aggregation module for enhancing their interactions. An experimental study using seven publicly available datasets offers evidence that MHNN can outperform other cutting-edge models and offers evidence of robustness to missing values and noise. An ablation study confirms the importance of each module.