Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion

Mar-22-2026, 06:16:22 GMT–Neural Information Processing Systems

How to decode human vision through neural signals has attracted a long-standing interest in neuroscience and machine learning. Modern contrastive learning and generative models improved the performance of visual decoding and reconstruction based on functional Magnetic Resonance Imaging (fMRI). However, the high cost and low temporal resolution of fMRI limit their applications in brain-computer interfaces (BCIs), prompting a high need for visual decoding based on electroencephalography (EEG). In this study, we present an end-to-end EEG-based visual reconstruction zero-shot framework, consisting of a tailored brain encoder, called the Adaptive Thinking Mapper (ATM), which projects neural signals from different sources into the shared subspace as the clip embedding, and a two-stage multi-pipe EEG-to-image generation strategy. In stage one, EEG is embedded to align the high-level clip embedding, and then the prior diffusion model refines EEG embedding into image priors.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Mar-22-2026, 06:16:22 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine
  - Health Care Technology (1.00)
  - Diagnostic Medicine > Imaging (0.59)
  - Therapeutic Area > Neurology (0.39)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)