Going faster than TensorFlow on the GPU with Clojure (GTX 1080Ti)

#artificialintelligence 

In the previous article, we have only compared the libraries on the CPU. Deep Diamond was considerably faster: 368 seconds vs 509 seconds. Most readers were intrigued, but, being skeptical as they should be, they complained that CPU performance doesn't matter anyway, since everybody uses GPU for training convolution networks; let's do the GPU comparison then. Both Deep Diamond, and Keras with TensorFlow, use Nvidia's cuDNN low level performance library under the hood, and any difference is due to the higher-level implementation. Deep Diamond completes this training in 21 seconds while Keras TensorFlow takes 35 seconds.