Layered SGD: A Decentralized and Synchronous SGD Algorithm for Scalable Deep Neural Network Training