Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation

Yu, Jin, Aberdeen, Douglas, Schraudolph, Nicol N.

Neural Information Processing Systems 

The stochastic meta--descent (SMD) gain adaptation algorithm [3, 4] can considerably accelerate the convergence of stochastic gradient descent.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found