GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange