Calibration tests in multi-class classification: A unifying framework