Predicting Long Term Sequential Policy Value Using Softer Surrogates