Genetic-Gated Networks for Deep Reinforcement Learning

Simyung Chang, John Yang, Jaeseok Choi, Nojun Kwak

Neural Information Processing Systems 

Exploiting the short-sighted gradients should be balanced with adequate explorations. Explorations thus should be designed irrelevant to policy gradients in order to guide the policy to unseen states.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found