A Self-Tuning Actor-Critic Algorithm
–Neural Information Processing Systems
In this paper, we take a step towards addressing this issue by using metagradients to automatically adapt hyperparameters online by meta-gradient descent (Xu et al., 2018).
Neural Information Processing Systems
Aug-17-2025, 05:18:45 GMT