Mitigating bias in calibration error estimation