Expected Sarsa($\lambda$) with Control Variate for Variance Reduction

Open in new window