A generative framework to bridge data-driven models and scientific theories in language neuroscience