Variance Reduction via Resampling and Experience Replay