Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals
–Neural Information Processing Systems
Invasive brain-computer interfaces with Electrocorticography (ECoG) have shown promise for high-performance speech decoding in medical applications, but less damaging methods like intracranial stereo-electroencephalography (sEEG) remain underexplored. With rapid advances in representation learning, leveraging abundant recordings to enhance speech decoding is increasingly attractive. However, popular methods often pre-train temporal models based on brain-level tokens, overlooking that brain activities in different regions are highly desynchronized during tasks. Alternatively, they pre-train spatial-temporal models based on channel-level tokens but fail to evaluate them on challenging tasks like speech decoding, which requires intricate processing in specific language-related areas. To address this issue, we collected a well-annotated Chinese word-reading sEEG dataset targeting languagerelated brain networks from 12 subjects.
Neural Information Processing Systems
May-31-2025, 10:54:03 GMT
- Genre:
- Research Report > Experimental Study (0.67)
- Industry:
- Health & Medicine
- Health Care Technology (1.00)
- Therapeutic Area > Neurology (1.00)
- Health & Medicine
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Neuroscience (0.66)
- Machine Learning > Neural Networks (0.67)
- Natural Language (1.00)
- Representation & Reasoning (0.66)
- Information Technology > Artificial Intelligence