Faster SGD training by minibatch persistency

Open in new window