Tinkering with Monte Carlo Method in Reinforcement Learning

Dec-18-2021, 09:45:58 GMT–#artificialintelligence

Monte Carlo, as well as Dynamic Programming, Temporal Difference are the main methods for starters in Reinforcement Learning. First, let's have a brief reminder of what is Monte Carlo method. Monte Carlo is an algorithm that generates paths (which constitutes an episode) based on the current policy which usually splits between exploration and exploitation, like epsilon greedy, until the path reaches a terminal state. Once that state is reached, the algorithm goes back through that path again and affects each state the discounted rewards that are met during the episode. These values (discounts rewards) are averaged with any other values that happen to be contained in those states.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

#artificialintelligence

Dec-18-2021, 09:45:58 GMT

News Web Page

Add feedback

Industry:
- Energy > Oil & Gas > Upstream (0.35)

Technology:
- Information Technology
  - Mathematics of Computing (0.92)
  - Artificial Intelligence > Machine Learning
    - Reinforcement Learning (0.91)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found