Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

Apr-6-2023, 16:42:39 GMT–Neural Information Processing Systems

In reinforcement learning problems, the aim is to select a controller that will maximize the average reward in some environment.

cid, value function, variance, (14 more...)

Neural Information Processing Systems

Apr-6-2023, 16:42:39 GMT

Conferences Web Page

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.62)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.79)