Mixture of Experts for Audio-Visual Learning Yang Li1,2 Junjie He1,2