How Parallelization and Large Batch Size Improve the Performance of Deep Neural Networks.