BrainStratify: Coarse-to-Fine Disentanglement of Intracranial Neural Dynamics
Zheng, Hui, Wang, Hai-Teng, Jing, Yi-Tao, Lin, Pei-Yang, Zhao, Han-Qing, Chen, Wei, Wei, Peng-Hu, Shan, Yong-Zhi, Zhao, Guo-Guang, Liu, Yun-Zhe
–arXiv.org Artificial Intelligence
Decoding speech directly from neural activity is a central goal in brain-computer interface (BCI) research. In recent years, exciting advances have been made through the growing use of intracranial field potential recordings, such as stereo-ElectroEncephaloGraphy (sEEG) and ElectroCorticoGraphy (ECoG). These neural signals capture rich population-level activity but present key challenges: (i) task-relevant neural signals are sparsely distributed across sEEG electrodes, and (ii) they are often entangled with task-irrelevant neural signals in both sEEG and ECoG. To address these challenges, we introduce a unified Coarse-to-Fine neural disentanglement framework, BrainStratify, which includes (i) identifying functional groups through spatial-context-guided temporal-spatial modeling, and (ii) disentangling distinct neural dynamics within the target functional group using Decoupled Product Quantization (DPQ). We evaluate BrainStratify on two open-source sEEG datasets and one (epidural) ECoG dataset, spanning tasks like vocal production and speech perception. Extensive experiments show that BrainStratify, as a unified framework for decoding speech from intracranial neural signals, significantly outperforms previous decoding methods. Overall, by combining data-driven stratification with neuroscience-inspired modularity, BrainStratify offers a robust and interpretable solution for speech decoding from intracranial recordings.
arXiv.org Artificial Intelligence
May-28-2025
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine
- Therapeutic Area > Neurology (1.00)
- Health Care Technology (0.88)
- Health & Medicine
- Technology: