Evolved Policy Gradients
Rein Houthooft, Yuhua Chen, Phillip Isola, Bradly Stadie, Filip Wolski, OpenAI Jonathan Ho, Pieter Abbeel
–Neural Information Processing Systems
We propose a metalearning approach for learning gradient-based reinforcement learning (RL) algorithms.
Neural Information Processing Systems
Nov-20-2025, 17:33:33 GMT