ADASECANT: Robust Adaptive Secant Method for Stochastic Gradient