Probabilistic measures afford fair comparisons of AIWP and NWP model output