Reliable Selection of Heterogeneous Treatment Effect Estimators