We Have It Covered: A Resampling-based Method for Uplift Model Comparison