How to Train a Robot-Agent CartPole Using Q-Learning

Jul-11-2020, 01:36:45 GMT–#artificialintelligence

Q-learning is a model-free reinforcement learning algorithm to learn a policy telling an agent what action to take under what circumstances. It does not require a model of the environment, and it can handle problems with stochastic transitions and rewards, without requiring adaptations. For any finite Markov decision process (FMDP), Q-learning finds an optimal policy in the sense of maximizing the expected value of the total reward over any and all successive steps, starting from the current state. Q-learning can identify an optimal action-selection policy for any given FMDP, given infinite exploration time and a partly-random policy. "Q" names the function that returns the reward used to provide the reinforcement and can be said to stand for the "quality" of an action taken in a given state.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

#artificialintelligence

Jul-11-2020, 01:36:45 GMT

News Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games > Computer Games (0.35)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.33)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found