A Penny for Your Thoughts: Decoding Speech from Inexpensive Brain Signals

Auster, Quentin, Shapovalenko, Kateryna, Ma, Chuang, Sun, Demaio

Nov-10-2025–arXiv.org Artificial Intelligence

We explore whether neural networks can decode brain activity into speech by mapping EEG recordings to audio representations. Using EEG data recorded as subjects listened to natural speech, we train a model with a contrastive CLIP loss to align EEG-derived embeddings with embeddings from a pre-trained transformer-based speech model. Building on the state-of-the-art EEG decoder from Meta, we introduce three architectural modifications: (i) subject-specific attention layers (+0.15% WER improvement), (ii) personalized spatial attention (+0.45%), and (iii) a dual-path RNN with attention (-1.87%). Two of the three modifications improved performance, highlighting the promise of personalized architectures for brain-to-speech decoding and applications in brain-computer interfaces.

machine learning, meta, natural language, (18 more...)

arXiv.org Artificial Intelligence

Nov-10-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.47)

Industry:
- Health & Medicine
  - Therapeutic Area > Neurology (1.00)
  - Health Care Technology (0.86)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found