Learning from the Wisdom of Crowds by Minimax Entropy

Zhou, Dengyong, Basu, Sumit, Mao, Yi, Platt, John C.

Neural Information Processing Systems 

An important way to make large training sets is to gather noisy labels from crowds of nonexperts. We propose a minimax entropy principle to improve the quality of these labels. Our method assumes that labels are generated by a probability distribution over workers, items, and labels. By maximizing the entropy of this distribution, the method naturally infers item confusability and worker expertise. We infer the ground truth by minimizing the entropy of this distribution, which we show minimizes the Kullback-Leibler (KL) divergence between the probability distribution and the unknown truth. We show that a simple coordinate descent scheme can optimize minimax entropy. Empirically, our results are substantially better than previously published methods for the same problem.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found