Reinforcement Learning Intro: Markov Decision Process
Your learning system is often called an agent, in most modern literature. This term includes more than just the neural network estimator in recent algorithms. The agent interacts with an environment to solve a task. We'll go into further details later into what constitutes the environment, as it can become problematic. The environment relies on two main components: its transition and reward functions.
Aug-16-2022, 16:29:52 GMT