Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation
Yu, Jin, Aberdeen, Douglas, Schraudolph, Nicol N.
–Neural Information Processing Systems
The stochastic meta--descent (SMD) gain adaptation algorithm [3, 4] can considerably accelerate the convergence of stochastic gradient descent.
Neural Information Processing Systems
Dec-31-2006