Swapout: Learning an ensemble of deep architectures

Neural Information Processing Systems 

We describe Swapout, a new stochastic training method, that outperforms ResNets of identical network structure yielding impressive results on CIFAR-10 and CIFAR-100.