Active Bayesian Assessment for Black-Box Classifiers