Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control

Open in new window