Unbiased Estimation of the Value of an Optimized Policy