A Controlled Reevaluation of Coreference Resolution Models