Spectral Methods Meet EM: A Provably Optimal Algorithm for Crowdsourcing Xi Chen Dengyong Zhou

Neural Information Processing Systems 

The Dawid-Skene estimator has been widely used for inferring the true labels from the noisy labels provided by non-expert crowdsourcing workers. However, since the estimator maximizes a non-convex log-likelihood function, it is hard to theoretically justify its performance. In this paper, we propose a two-stage efficient algorithm for multi-class crowd labeling problems. The first stage uses the spectral method to obtain an initial estimate of parameters.