Enhancing Policy Gradient with the Polyak Step-Size Adaption

Open in new window