Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers