Genetic-Gated Networks for Deep Reinforcement Learning
Simyung Chang, John Yang, Jaeseok Choi, Nojun Kwak
–Neural Information Processing Systems
Exploiting the short-sighted gradients should be balanced with adequate explorations. Explorations thus should be designed irrelevant to policy gradients in order to guide the policy to unseen states.
Neural Information Processing Systems
Nov-18-2025, 19:46:54 GMT