Reviews: Total stochastic gradient algorithms and applications in reinforcement learning

Oct-7-2024, 05:01:13 GMT–Neural Information Processing Systems

This paper provides another formalism for gradient estimation in probabilistic computation graphs. Using pathwise derivative and likelihood ratio estimators, existing and well-known policy gradient theorems are cast into the proposed formalism. This intuition is then used to propose two new methods for gradient estimation that can be used in a model-based RL framework. Some results are shown that demonstrate comparable results to PILCO on the cart-pole task. Quality: the idea in this work is interesting, and the proposed framework and methods may prove useful in RL settings.

estimator, stochastic gradient algorithm and application, total stochastic gradient algorithm, (9 more...)

Neural Information Processing Systems

Oct-7-2024, 05:01:13 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Mathematical & Statistical Methods (0.40)
  - Machine Learning
    - Statistical Learning > Gradient Descent (0.40)
    - Reinforcement Learning (0.40)