ReincarnatingReinforcementLearning: ReusingPriorComputationtoAccelerateProgress

Neural Information Processing Systems 

The vertical separators correspond to loading network weights and replay buffer for fine-tuning while offline pre-training on replay buffer using QDagger (Section 4.1) for reincarnation. Shaded regions show 95% confidence intervals.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found