RL -- Value Fitting & Q-Learning
We can learn the value function and the Q-value function iteratively. In practice, we don't have enough memory for all the states. The most common method is to use a deep network as a function approximator. If the state space is continuous or large, it is not possible to use a large memory table to record V(S) for every state. However, like other deep learning methods, we can create a function estimator to approximate it.
Aug-16-2021, 15:35:14 GMT
- Country:
- North America > United States > California
- San Francisco County > San Francisco (0.05)
- San Diego County > San Diego (0.05)
- North America > United States > California
- Industry:
- Leisure & Entertainment > Games (0.48)
- Technology: