Attention Bottlenecks for Multimodal Fusion - Supplementary Materials Arsha Nagrani Shan Yang Anurag Arnab Aren Jansen Cordelia Schmid Chen Sun

Neural Information Processing Systems 

Here we provide additional ablation results on mini-Audioset (Sec. We then provide results on two additional datasets, Moments in Time and Kinetics in Sec. C and perform some preliminary transfer learning experiments in Sec. E. Finally we provide details on the AS-500K split. In this section we expand on the ablations provided in Sec.