AITopics | ribeiro

Collaborating Authors

ribeiro

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ResilientConstrainedLearning

Neural Information Processing SystemsFeb-17-2026, 15:19:09 GMT

When deploying machine learning solutions, they must satisfy multiple requirementsbeyondaccuracy,suchasfairness,robustness,orsafety.

artificial intelligence, machine learning, p-res, (18 more...)

Neural Information Processing Systems

Country:

Europe > Belgium > Flanders > East Flanders > Ghent (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Education (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Stability of Graph Scattering Transforms

Fernando Gama, Alejandro Ribeiro, Joan Bruna

Neural Information Processing SystemsFeb-11-2026, 23:27:47 GMT

Whenconsidering networkdata,regularconvolutionsdonot hold since the data domain presents an irregular structure given by the network topology.

artificial intelligence, convolution, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Communications > Networks (0.49)

Add feedback

ProbablyApproximatelyCorrectConstrainedLearning

Neural Information Processing SystemsFeb-10-2026, 04:05:48 GMT

As these systems become ubiquitous, so does the need tocurtail their behavior.

artificial intelligence, machine learning, optimization problem, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

EcoScaleNet: A Lightweight Multi Kernel Network for Long Sequence 12 lead ECG Classification

Kang, Dong-Hyeon, Nam, Ju-Hyeon, Lee, Sang-Chul

arXiv.org Artificial IntelligenceOct-30-2025

Accurate interpretation of 12-lead electrocardiograms (ECGs) is critical for early detection of cardiac abnormalities, yet manual reading is error-prone and existing CNN-based classifiers struggle to choose receptive-field sizes that generalize to the long sequences typical of ECGs. Omni-Scale CNN (OS-CNN) addresses this by enumerating prime-sized kernels inspired by Goldbach's conjecture to cover every scale, but its exhaustive design explodes computational cost and blocks deeper, wider models. We present E fficient C onvolutional O mni-Scale Net work (EcoScale-Net), a hierarchical variant that retains full receptive-field coverage while eliminating redundancy. At each stage, the maximum kernel length is capped to the scale still required after down-sampling, and 1 1 bottleneck convolutions inserted before and after every Omni-Scale block curtail channel growth and fuse multi-scale features. On the large-scale CODE-15% ECG dataset, EcoScale-Net reduces parameters by 90% and FLOPs by 99% compared with OS-CNN, while raising macro-averaged F1-score by 2.4%. These results demonstrate that EcoScale-Net delivers state-of-the-art accuracy for long-sequence ECG classification at a fraction of the computational cost, enabling real-time deployment on commodity hardware. Our EcoScale-Net code is available in GitHub Link.

artificial intelligence, ecoscale-net, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.24748

Genre: Research Report (0.84)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Explaining deep learning for ECG using time-localized clusters

Boubekki, Ahcène, Patlatzoglou, Konstantinos, Barker, Joseph, Ng, Fu Siong, Ribeiro, Antônio H.

arXiv.org Machine LearningSep-19-2025

Deep learning has significantly advanced electrocardiogram (ECG) analysis, enabling automatic annotation, disease screening, and prognosis beyond traditional clinical capabilities. However, understanding these models remains a challenge, limiting interpretation and gaining knowledge from these developments. In this work, we propose a novel interpretability method for convolutional neural networks applied to ECG analysis. Our approach extracts time-localized clusters from the model's internal representations, segmenting the ECG according to the learned characteristics while quantifying the uncertainty of these representations. This allows us to visualize how different waveform regions contribute to the model's predictions and assess the certainty of its decisions. By providing a structured and interpretable view of deep learning models for ECG, our method enhances trust in AI-driven diagnostics and facilitates the discovery of clinically relevant electrophysiological patterns.

ecg, explanation, representation, (16 more...)

arXiv.org Machine Learning

2509.15198

Country:

Europe > Sweden > Uppsala County > Uppsala (0.04)
South America > Brazil > Minas Gerais (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(4 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SMOGAN: Synthetic Minority Oversampling with GAN Refinement for Imbalanced Regression

Alahyari, Shayan, Domaratzki, Mike

arXiv.org Artificial IntelligenceAug-11-2025

Imbalanced regression refers to prediction tasks where the target variable is skewed. This skewness hinders machine learning models, especially neural networks, which concentrate on dense regions and therefore perform poorly on underrepresented (minority) samples. Despite the importance of this problem, only a few methods have been proposed for imbalanced regression. Many of the available solutions for imbalanced regression adapt techniques from the class imbalance domain, such as linear interpolation and the addition of Gaussian noise, to create synthetic data in sparse regions. However, in many cases, the underlying distribution of the data is complex and non-linear. Consequently, these approaches generate synthetic samples that do not accurately represent the true feature-target relationship. To overcome these limitations, we propose SMOGAN, a two-step oversampling framework for imbalanced regression. In Stage 1, an existing oversampler generates initial synthetic samples in sparse target regions. In Stage 2, we introduce DistGAN, a distribution-aware GAN that serves as SMOGAN's filtering layer and refines these samples via adversarial loss augmented with a Maximum Mean Discrepancy objective, aligning them with the true joint feature-target distribution. Extensive experiments on 23 imbalanced datasets show that SMOGAN consistently outperforms the default oversampling method without the DistGAN filtering layer.

artificial intelligence, machine learning, regression, (15 more...)

arXiv.org Artificial Intelligence

2504.21152

Country: North America > Canada > Ontario (0.14)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Regression Augmentation With Data-Driven Segmentation

Alahyari, Shayan, Ghobadlou, Shiva Mehdipour, Domaratzki, Mike

arXiv.org Artificial IntelligenceAug-5-2025

Imbalanced regression arises when the target distribution is skewed, causing models to focus on dense regions and struggle with underrepresented (minority) samples. Despite its relevance across many applications, few methods have been designed specifically for this challenge. Existing approaches often rely on fixed, ad hoc thresholds to label samples as rare or common, overlooking the continuous complexity of the joint feature-target space and fail to represent the true underlying rare regions. To address these limitations, we propose a fully data-driven GAN-based augmentation framework that uses Mahalanobis-Gaussian Mixture Modeling (GMM) to automatically identify minority samples and employs deterministic nearest-neighbour matching to enrich sparse regions. Rather than preset thresholds, our method lets the data determine which observations are truly rare. Evaluation on 32 benchmark imbalanced regression datasets demonstrates that our approach consistently outperforms state-of-the-art data augmentation methods.

data mining, machine learning, regression, (16 more...)

arXiv.org Artificial Intelligence

2508.01455

Country: North America > Canada > Ontario (0.14)

Genre:

Research Report (0.82)
Overview (0.68)

Industry: Materials (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Graph Semi-Supervised Learning for Point Classification on Data Manifolds

Netto, Caio F. Deberaldini, Wang, Zhiyang, Ruiz, Luana

arXiv.org Machine LearningJun-17-2025

We propose a graph semi-supervised learning framework for classification tasks on data manifolds. Motivated by the manifold hypothesis, we model data as points sampled from a low-dimensional manifold $\mathcal{M} \subset \mathbb{R}^F$. The manifold is approximated in an unsupervised manner using a variational autoencoder (VAE), where the trained encoder maps data to embeddings that represent their coordinates in $\mathbb{R}^F$. A geometric graph is constructed with Gaussian-weighted edges inversely proportional to distances in the embedding space, transforming the point classification problem into a semi-supervised node classification task on the graph. This task is solved using a graph neural network (GNN). Our main contribution is a theoretical analysis of the statistical generalization properties of this data-to-manifold-to-graph pipeline. We show that, under uniform sampling from $\mathcal{M}$, the generalization gap of the semi-supervised task diminishes with increasing graph size, up to the GNN training error. Leveraging a training procedure which resamples a slightly larger graph at regular intervals during training, we then show that the generalization gap can be reduced even further, vanishing asymptotically. Finally, we validate our findings with numerical experiments on image classification benchmarks, demonstrating the empirical effectiveness of our approach.

artificial intelligence, graph, machine learning, (16 more...)

arXiv.org Machine Learning

2506.12197

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Maryland > Baltimore (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
(11 more...)

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

CART-based Synthetic Tabular Data Generation for Imbalanced Regression

Pinheiro, António Pedro, Ribeiro, Rita P.

arXiv.org Artificial IntelligenceJun-4-2025

Handling imbalanced target distributions in regression tasks remains a significant challenge in tabular data settings where underrep-resented regions can hinder model performance. Among data-level solutions, some proposals, such as random sampling and SMOTE-based approaches, propose adapting classification techniques to regression tasks. However, these methods typically rely on crisp, artificial thresholds over the target variable, a limitation inherited from classification settings that can introduce arbitrariness, often leading to non-intuitive and potentially misleading problem formulations. While recent generative models, such as GANs and VAEs, provide flexible sample synthesis, they come with high computational costs and limited interpretability. In this study, we propose adapting an existing CART-based synthetic data generation method, tailoring it for imbalanced regression. The new method integrates relevance and density-based mechanisms to guide sampling in sparse regions of the target space and employs a threshold-free, feature-driven generation process. Our experimental study focuses on the prediction of extreme target values across benchmark datasets. The results indicate that the proposed method is competitive with other resampling and generative strategies in terms of performance, while offering faster execution and greater transparency.

artificial intelligence, machine learning, regression, (15 more...)

arXiv.org Artificial Intelligence

2506.02811

Country: Europe > Portugal (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Interpretable Rules for Online Failure Prediction: A Case Study on the Metro do Porto dataset

Jakobs, Matthias, Veloso, Bruno, Gama, Joao

arXiv.org Artificial IntelligenceFeb-11-2025

Due to their high predictive performance, predictive maintenance applications have increasingly been approached with Deep Learning techniques in recent years. However, as in other real-world application scenarios, the need for explainability is often stated but not sufficiently addressed. This study will focus on predicting failures on Metro trains in Porto, Portugal. While recent works have found high-performing deep neural network architectures that feature a parallel explainability pipeline, the generated explanations are fairly complicated and need help explaining why the failures are happening. This work proposes a simple online rule-based explainability approach with interpretable features that leads to straightforward, interpretable rules. We showcase our approach on MetroPT2 and find that three specific sensors on the Metro do Porto trains suffice to predict the failures present in the dataset with simple rules. The most straightforward approach, corrective maintenance, merely replaces machine parts whenever they break.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2502.07394

Country:

Europe > Portugal > Porto > Porto (0.25)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Transportation (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback