[1605.06465] Swapout: Learning an ensemble of deep architectures (samples from dropout, ResNets, stochastic depth) • /r/MachineLearning