Artificial Intelligence IV - Reinforcement Learning in Java
This course is about Reinforcement Learning. The first step is to talk about the mathematical background: we can use a Markov Decision Process as a model for reinforcement learning. We can solve the problem 3 ways: value-iteration, policy-iteration and Q-learning. Q-learning is a model free approach so it is state-of-the-art approach. It learns the optimal policy by interacting with the environment.
Apr-8-2018, 16:39:14 GMT
- Genre:
- Industry:
- Technology: