Improving Deep Ensembles by Estimating Confusion Matrices