A stochastic optimization approach to train non-linear neural networks with a higher-order variation regularization