Evolved Policy Gradients
Rein Houthooft, Yuhua Chen, Phillip Isola, Bradly Stadie, Filip Wolski, OpenAI Jonathan Ho, Pieter Abbeel
–Neural Information Processing Systems
We propose a metalearning approach for learning gradient-based reinforcement learning (RL) algorithms.
Neural Information Processing Systems
Nov-18-2025, 01:06:56 GMT