CodeBrain: Towards Decoupled Interpretability and Multi-Scale Architecture for EEG Foundation Model
Ma, Jingying, Wu, Feng, Lin, Qika, Xing, Yucheng, Liu, Chenyu, Jia, Ziyu, Feng, Mengling
–arXiv.org Artificial Intelligence
Electroencephalography (EEG) provides real-time insights into brain activity and supports diverse applications in neuroscience. While EEG foundation models (EFMs) have emerged to address the scalability issues of task-specific models, current approaches still yield clinically uninterpretable and weakly discriminative representations, inefficiently capture global dependencies, and neglect important local neural events. We present CodeBrain, a two-stage EFM designed to fill this gap. In the first stage, we introduce the TFDual-Tokenizer, which decouples heterogeneous temporal and frequency EEG signals into discrete tokens, quadratically expanding the representation space to enhance discriminative power and offering domain-specific interpretability by suggesting potential links to neural events and spectral rhythms. In the second stage, we propose the multi-scale EEGSSM architecture, which combines structured global convolution with sliding window attention to efficiently capture both sparse long-range and local dependencies, reflecting the brain's small-world topology. Pretrained on the largest public EEG corpus, CodeBrain achieves strong generalization across 8 downstream tasks and 10 datasets under distribution shifts, supported by comprehensive ablations, scaling-law analyses, and interpretability evaluations. Both code and pretraining weights will be released in the future version.
arXiv.org Artificial Intelligence
Sep-26-2025
- Country:
- Asia
- North America > United States
- Illinois > DuPage County
- Darien (0.04)
- Massachusetts (0.04)
- Illinois > DuPage County
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Technology: