Monitoring the calibration of probability forecasts with an application to concept drift detection involving image classification