When majority voting fails: Comparing quality assurance methods for noisy human computation environment