Advanced Reinforcement Learning: policy gradient methods
Sample efficiency for policy gradient methods is pretty poor. We throw out each batch of data immediately after just one gradient step. This is the most complete Reinforcement Learning course series on Udemy. In it, you will learn to implement some of the most powerful Deep Reinforcement Learning algorithms in Python using PyTorch and PyTorch lightning. You will implement from scratch adaptive algorithms that solve control tasks based on experience.
May-21-2022, 13:44:04 GMT