What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?