EfficientSchedulingofDataAugmentation forDeepReinforcementLearning
–Neural Information Processing Systems
However,evenwhentheprior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency.
Neural Information Processing Systems
Feb-12-2026, 04:48:26 GMT