RatioWaveNet: A Learnable RDWT Front-End for Robust and Interpretable EEG Motor-Imagery Classification

Siino, Marco, Bonomo, Giuseppe, Sorbello, Rosario, Tinnirello, Ilenia

arXiv.org Artificial Intelligence 

Brain-computer interfaces (BCIs) based on motor imagery (MI) translate covert movement intentions into actionable commands, yet reliable decoding from non-invasive EEG remains challenging due to nonstationarity, low SNR, and subject variability. The RDWT performs an undecimated, multi-resolution subband decomposition that preserves temporal length and shift-invariance, enhancing sensorimotor rhythms while mitigating jitter and mild artifacts; subbands are fused via lightweight grouped 1-D convolutions and passed to a multi-kernel CNN for local temporal-spatial feature extraction, a grouped-query attention encoder for long-range context, and a compact TCN head for causal temporal integration. Our goal is to test whether this principled wavelet front end improves robustness precisely where BCIs typically fail--on the hardest subjects--and whether such gains persist on average across seeds under both intra-and inter-subject protocols. On BCI-IV-2a and BCI-IV-2b, across five seeds, RatioWaveNet improves worst-subject accuracy over the Transformer backbone by +0.17 / +0.42 percentage points (Sub-Dependent / LOSO) on 2a and by +1.07 / +2.54 percentage points on 2b, with consistent average-case gains These results indicate that a simple, trainable wavelet front end is an effective plug-in to strengthen Transformer-based BCIs, improving worst-case reliability without sacrificing efficiency. Introduction Brain-computer interfaces (BCIs) establish a direct communication pathway between neural activity and external devices by decoding brain signals into actionable commands, with the potential to transform both clinical rehabilitation and human-computer interactions and thereby improve quality of life [1, 2]. Among brain-sensing modalities, electroencephalography (EEG) is particularly well suited to real-world deployment because it is noninvasive, inexpensive, portable, and offers millisecond-level temporal resolution; these properties have underpinned a broad spectrum of EEG-based applications spanning cognitive skill assessment [3], driver vigilance estimation [4], emotion recognition [5], and human-robot interaction [6]. In healthcare, EEG is a key enabler of smart health ecosystems [7], supporting tasks such as automated sleep staging [8], seizure detection and monitoring [9], and neurorehabilitation after stroke [10]. Within this context, Motor Imagery (MI) - the covert rehearsal of movement without execution - has become one of the most widely adopted BCI paradigms [1].