Statistical modality tagging from rule-based annotations and crowdsourcing

Prabhakaran, Vinodkumar, Bloodgood, Michael, Diab, Mona, Dorr, Bonnie, Levin, Lori, Piatko, Christine D., Rambow, Owen, Van Durme, Benjamin

Mar-3-2015–arXiv.org Machine Learning

We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

Mar-3-2015

arXiv.org PDF

Add feedback

Country:
- Asia (1.00)
- North America > United States
  - Colorado (0.14)
  - Oregon (0.14)
  - Maryland (0.14)

Genre:
- Research Report (0.82)

Technology:
- Information Technology
  - Communications > Social Media
    - Crowdsourcing (0.70)
  - Artificial Intelligence
    - Representation & Reasoning > Rule-Based Reasoning (1.00)
    - Machine Learning (1.00)
    - Natural Language > Text Processing (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found