Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals

May-31-2025, 10:54:03 GMT–Neural Information Processing Systems

Invasive brain-computer interfaces with Electrocorticography (ECoG) have shown promise for high-performance speech decoding in medical applications, but less damaging methods like intracranial stereo-electroencephalography (sEEG) remain underexplored. With rapid advances in representation learning, leveraging abundant recordings to enhance speech decoding is increasingly attractive. However, popular methods often pre-train temporal models based on brain-level tokens, overlooking that brain activities in different regions are highly desynchronized during tasks. Alternatively, they pre-train spatial-temporal models based on channel-level tokens but fail to evaluate them on challenging tasks like speech decoding, which requires intricate processing in specific language-related areas. To address this issue, we collected a well-annotated Chinese word-reading sEEG dataset targeting languagerelated brain networks from 12 subjects.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

May-31-2025, 10:54:03 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.28)

Genre:
- Research Report > Experimental Study (0.67)

Industry:
- Health & Medicine
  - Health Care Technology (1.00)
  - Therapeutic Area > Neurology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Neuroscience (0.66)
  - Machine Learning > Neural Networks (0.67)
  - Natural Language (1.00)
  - Representation & Reasoning (0.66)