Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget