Comparison of non-linear activation functions for deep neural networks on MNIST classification task