EfficientSchedulingofDataAugmentation forDeepReinforcementLearning

Neural Information Processing Systems 

However,evenwhentheprior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found