RL -- Value Fitting & Q-Learning

Aug-16-2021, 15:35:14 GMT–#artificialintelligence

We can learn the value function and the Q-value function iteratively. In practice, we don't have enough memory for all the states. The most common method is to use a deep network as a function approximator. If the state space is continuous or large, it is not possible to use a large memory table to record V(S) for every state. However, like other deep learning methods, we can create a function estimator to approximate it.

monte-carlo method, q-learning, value learning, (14 more...)

#artificialintelligence

Aug-16-2021, 15:35:14 GMT

News Web Page

Add feedback

Country:
- North America > United States > California
  - San Francisco County > San Francisco (0.05)
  - San Diego County > San Diego (0.05)

Industry:
- Leisure & Entertainment > Games (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.55)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found