Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes

Neural Information Processing Systems 

Training a large-scale deep neural network in a large-scale dataset is challenging and time-consuming. The recent breakthrough of large-batch optimization is a promising way to tackle this challenge.