Evolved Policy Gradients

Rein Houthooft, Yuhua Chen, Phillip Isola, Bradly Stadie, Filip Wolski, OpenAI Jonathan Ho, Pieter Abbeel

Neural Information Processing Systems 

We propose a metalearning approach for learning gradient-based reinforcement learning (RL) algorithms.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found