Statistical modality tagging from rule-based annotations and crowdsourcing