Let's see some simple example which helps you to illustrate the reinforcement learning mechanism. Consider the scenario of teaching new tricks to your cat. There are three approaches to implement a Reinforcement Learning algorithm. In a value-based Reinforcement Learning method, you should try to maximize a value function V(s). In this method, the agent is expecting a long-term return of the current states under policy π.
Mar-11-2021, 04:10:34 GMT