Brain Harmony: A Multimodal Foundation Model Unifying Morphology and Function into 1D Tokens
–Neural Information Processing Systems
The model was pretrained on two of the largest neuroimaging datasets to date, encompassing 64,594 T1-weighted structural MRI 3D volumes ( 14 million images) and 70,933 functional MRI (fMRI) time series. BrainHarmonix is grounded in two foundational neuroscience principles: - structural and functional modalities offer distinct yet synergistic insights into brain organization; - brain functional dynamics are shaped by cortical morphology. The modular pretraining process involves single-modality training with geometric pre-alignment followed by modality fusion through shared brain hub tokens. Notably, our dynamics encoder uniquely handles fMRI time series with heterogeneous repetition times (TRs), addressing a major limitation in existing models. BrainHarmonix is also the first to deeply compress high-dimensional neuroimaging signals into unified, continuous 1D tokens, forming a compact latent space of the human brain. BrainHarmonix achieves strong generalization across diverse downstream tasks, including neurodevelopmental and neurodegenerative disorder classification and cognition prediction - consistently outperforming previous approaches. Our models - pretrained on 8 H100 GPUs - aim to catalyze a new era of AI-driven neuroscience powered by large-scale multimodal neuroimaging.
Neural Information Processing Systems
Jun-13-2026, 21:32:57 GMT
- Industry:
- Health & Medicine
- Therapeutic Area > Neurology (1.00)
- Health Care Technology (1.00)
- Health & Medicine
- Technology: