ribeiro
EcoScaleNet: A Lightweight Multi Kernel Network for Long Sequence 12 lead ECG Classification
Kang, Dong-Hyeon, Nam, Ju-Hyeon, Lee, Sang-Chul
Accurate interpretation of 12-lead electrocardiograms (ECGs) is critical for early detection of cardiac abnormalities, yet manual reading is error-prone and existing CNN-based classifiers struggle to choose receptive-field sizes that generalize to the long sequences typical of ECGs. Omni-Scale CNN (OS-CNN) addresses this by enumerating prime-sized kernels inspired by Goldbach's conjecture to cover every scale, but its exhaustive design explodes computational cost and blocks deeper, wider models. We present E fficient C onvolutional O mni-Scale Net work (EcoScale-Net), a hierarchical variant that retains full receptive-field coverage while eliminating redundancy. At each stage, the maximum kernel length is capped to the scale still required after down-sampling, and 1 1 bottleneck convolutions inserted before and after every Omni-Scale block curtail channel growth and fuse multi-scale features. On the large-scale CODE-15% ECG dataset, EcoScale-Net reduces parameters by 90% and FLOPs by 99% compared with OS-CNN, while raising macro-averaged F1-score by 2.4%. These results demonstrate that EcoScale-Net delivers state-of-the-art accuracy for long-sequence ECG classification at a fraction of the computational cost, enabling real-time deployment on commodity hardware. Our EcoScale-Net code is available in GitHub Link.
Explaining deep learning for ECG using time-localized clusters
Boubekki, Ahcène, Patlatzoglou, Konstantinos, Barker, Joseph, Ng, Fu Siong, Ribeiro, Antônio H.
Deep learning has significantly advanced electrocardiogram (ECG) analysis, enabling automatic annotation, disease screening, and prognosis beyond traditional clinical capabilities. However, understanding these models remains a challenge, limiting interpretation and gaining knowledge from these developments. In this work, we propose a novel interpretability method for convolutional neural networks applied to ECG analysis. Our approach extracts time-localized clusters from the model's internal representations, segmenting the ECG according to the learned characteristics while quantifying the uncertainty of these representations. This allows us to visualize how different waveform regions contribute to the model's predictions and assess the certainty of its decisions. By providing a structured and interpretable view of deep learning models for ECG, our method enhances trust in AI-driven diagnostics and facilitates the discovery of clinically relevant electrophysiological patterns.
SMOGAN: Synthetic Minority Oversampling with GAN Refinement for Imbalanced Regression
Alahyari, Shayan, Domaratzki, Mike
Imbalanced regression refers to prediction tasks where the target variable is skewed. This skewness hinders machine learning models, especially neural networks, which concentrate on dense regions and therefore perform poorly on underrepresented (minority) samples. Despite the importance of this problem, only a few methods have been proposed for imbalanced regression. Many of the available solutions for imbalanced regression adapt techniques from the class imbalance domain, such as linear interpolation and the addition of Gaussian noise, to create synthetic data in sparse regions. However, in many cases, the underlying distribution of the data is complex and non-linear. Consequently, these approaches generate synthetic samples that do not accurately represent the true feature-target relationship. To overcome these limitations, we propose SMOGAN, a two-step oversampling framework for imbalanced regression. In Stage 1, an existing oversampler generates initial synthetic samples in sparse target regions. In Stage 2, we introduce DistGAN, a distribution-aware GAN that serves as SMOGAN's filtering layer and refines these samples via adversarial loss augmented with a Maximum Mean Discrepancy objective, aligning them with the true joint feature-target distribution. Extensive experiments on 23 imbalanced datasets show that SMOGAN consistently outperforms the default oversampling method without the DistGAN filtering layer.
Regression Augmentation With Data-Driven Segmentation
Alahyari, Shayan, Ghobadlou, Shiva Mehdipour, Domaratzki, Mike
Imbalanced regression arises when the target distribution is skewed, causing models to focus on dense regions and struggle with underrepresented (minority) samples. Despite its relevance across many applications, few methods have been designed specifically for this challenge. Existing approaches often rely on fixed, ad hoc thresholds to label samples as rare or common, overlooking the continuous complexity of the joint feature-target space and fail to represent the true underlying rare regions. To address these limitations, we propose a fully data-driven GAN-based augmentation framework that uses Mahalanobis-Gaussian Mixture Modeling (GMM) to automatically identify minority samples and employs deterministic nearest-neighbour matching to enrich sparse regions. Rather than preset thresholds, our method lets the data determine which observations are truly rare. Evaluation on 32 benchmark imbalanced regression datasets demonstrates that our approach consistently outperforms state-of-the-art data augmentation methods.
Graph Semi-Supervised Learning for Point Classification on Data Manifolds
Netto, Caio F. Deberaldini, Wang, Zhiyang, Ruiz, Luana
We propose a graph semi-supervised learning framework for classification tasks on data manifolds. Motivated by the manifold hypothesis, we model data as points sampled from a low-dimensional manifold $\mathcal{M} \subset \mathbb{R}^F$. The manifold is approximated in an unsupervised manner using a variational autoencoder (VAE), where the trained encoder maps data to embeddings that represent their coordinates in $\mathbb{R}^F$. A geometric graph is constructed with Gaussian-weighted edges inversely proportional to distances in the embedding space, transforming the point classification problem into a semi-supervised node classification task on the graph. This task is solved using a graph neural network (GNN). Our main contribution is a theoretical analysis of the statistical generalization properties of this data-to-manifold-to-graph pipeline. We show that, under uniform sampling from $\mathcal{M}$, the generalization gap of the semi-supervised task diminishes with increasing graph size, up to the GNN training error. Leveraging a training procedure which resamples a slightly larger graph at regular intervals during training, we then show that the generalization gap can be reduced even further, vanishing asymptotically. Finally, we validate our findings with numerical experiments on image classification benchmarks, demonstrating the empirical effectiveness of our approach.
CART-based Synthetic Tabular Data Generation for Imbalanced Regression
Pinheiro, António Pedro, Ribeiro, Rita P.
Handling imbalanced target distributions in regression tasks remains a significant challenge in tabular data settings where underrep-resented regions can hinder model performance. Among data-level solutions, some proposals, such as random sampling and SMOTE-based approaches, propose adapting classification techniques to regression tasks. However, these methods typically rely on crisp, artificial thresholds over the target variable, a limitation inherited from classification settings that can introduce arbitrariness, often leading to non-intuitive and potentially misleading problem formulations. While recent generative models, such as GANs and VAEs, provide flexible sample synthesis, they come with high computational costs and limited interpretability. In this study, we propose adapting an existing CART-based synthetic data generation method, tailoring it for imbalanced regression. The new method integrates relevance and density-based mechanisms to guide sampling in sparse regions of the target space and employs a threshold-free, feature-driven generation process. Our experimental study focuses on the prediction of extreme target values across benchmark datasets. The results indicate that the proposed method is competitive with other resampling and generative strategies in terms of performance, while offering faster execution and greater transparency.
Interpretable Rules for Online Failure Prediction: A Case Study on the Metro do Porto dataset
Jakobs, Matthias, Veloso, Bruno, Gama, Joao
Due to their high predictive performance, predictive maintenance applications have increasingly been approached with Deep Learning techniques in recent years. However, as in other real-world application scenarios, the need for explainability is often stated but not sufficiently addressed. This study will focus on predicting failures on Metro trains in Porto, Portugal. While recent works have found high-performing deep neural network architectures that feature a parallel explainability pipeline, the generated explanations are fairly complicated and need help explaining why the failures are happening. This work proposes a simple online rule-based explainability approach with interpretable features that leads to straightforward, interpretable rules. We showcase our approach on MetroPT2 and find that three specific sensors on the Metro do Porto trains suffice to predict the failures present in the dataset with simple rules. The most straightforward approach, corrective maintenance, merely replaces machine parts whenever they break.