Large-Scale Adversarial Training for Vision-and-Language Representation Learning

Neural Information Processing Systems 

To enable large-scale training, we adopt the "free" adversarial training strategy, and combine it with KL-divergence-based regularization to