Goto

Collaborating Authors

 st acx



A Self-Tuning Actor-Critic Algorithm

Neural Information Processing Systems

In this paper, we take a step towards addressing this issue by using metagradients to automatically adapt hyperparameters online by meta-gradient descent (Xu et al., 2018).