Improving Multimodal Brain Encoding Model with Dynamic Subject-awareness Routing
Yin, Xuanhua, Zhao, Runkai, Cai, Weidong
–arXiv.org Artificial Intelligence
Naturalistic fMRI encoding must handle multimodal inputs, shifting fusion styles, and pronounced inter-subject variability. We introduce AFIRE (Agnostic Framework for Multimodal fMRI Response Encoding), an agnostic interface that standardizes time-aligned post-fusion tokens from varied encoders, and MIND, a plug-and-play Mixture-of-Experts decoder with a subject-aware dynamic gating. Trained end-to-end for whole-brain prediction, AFIRE decouples the decoder from upstream fusion, while MIND combines token-dependent Top-K sparse routing with a subject prior to personalize expert usage without sacrificing generality. Experiments across multiple multimodal backbones and subjects show consistent improvements over strong baselines, enhanced cross-subject generalization, and interpretable expert patterns that correlate with content type. The framework offers a simple attachment point for new encoders and datasets, enabling robust, plug-and-improve performance for naturalistic neuroimaging studies.
arXiv.org Artificial Intelligence
Oct-13-2025
- Country:
- North America > United States (0.04)
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Genre:
- Research Report (0.40)
- Industry:
- Health & Medicine
- Health Care Technology (0.79)
- Therapeutic Area > Neurology (0.68)
- Health & Medicine
- Technology: