Prospect Theory Based Crowdsourcing for Classification in the Presence of Spammers

Geng, Baocheng, Li, Qunwei, Varshney, Pramod K.

arXiv.org Machine Learning 

--We consider the M -ary classification problem via crowdsourcing, where crowd workers respond to simple binary questions and the answers are aggregated via decision fusion. The workers have a reject option to skip answering a question when they do not have the expertise, or when the confidence of answering that question correctly is low. We further consider that there are spammers in the crowd who respond to the questions with random guesses. Under the payment mechanism that encourages the reject option, we study the behavior of honest workers and spammers, whose objectives are to maximize their monetary rewards. T o accurately characterize human behavioral aspects, we employ prospect theory to model the rationality of the crowd workers, whose perception of costs and probabilities are distorted based on some value and weight functions, respectively. Moreover, we estimate the number of spammers and employ a weighted majority voting decision rule, where we assign an optimal weight for every worker to maximize the system performance. The probability of correct classification and asymptotic system performance are derived. We also provide simulation results to demonstrate the effectiveness of our approach. ROWDSOURCING has attracted intense interest in recent years as a new paradigm for distributed inference. Crowdsourcing enables a new framework to utilize distributed human wisdom to solve problems that machines cannot perform well, like handwriting recognition, anomaly detection, voice transcription, and image labelling [8]-[11]. While conventional group collaboration and cooperation frameworks rely heavily on a collection of experts in related fields, the crowd in crowdsourcing usually consists of non-experts. Therefore, the responses obtained from the crowd have diverse quality levels, which makes decision fusion in the problem of classification via crowdsourcing quite challenging. Although crowdsourcing has been applied in many applications, the quality of the aggregated result is relatively low [12]-[14] due to the following reasons. First, the worker pool is anonymous in nature, which may result in an unskilled and unreliable crowd [15].

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found