Measuring model variability using robust non-parametric testing