Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation

Yu, Jin, Aberdeen, Douglas, Schraudolph, Nicol N.

Dec-31-2006–Neural Information Processing Systems

The stochastic meta--descent (SMD) gain adaptation algorithm [3, 4] can considerably accelerate the convergence of stochastic gradient descent.

artificial intelligence, gradient, machine learning, (14 more...)

Neural Information Processing Systems

Dec-31-2006

Conferences PDF

Country:
- Oceania > Australia (0.29)
- Europe > United Kingdom (0.28)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (0.70)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.33)

Duplicate Docs Excel Report

Title
Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation
Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation

Similar Docs Excel Report more

Title	Similarity	Source
None found