ROER: Regularized Optimal Experience Replay