Interactive Semantic Featuring for Text Classification
Jandot, Camille, Simard, Patrice, Chickering, Max, Grangier, David, Suh, Jina
In text classification, dictionaries can be used to define human-comprehensible features. We propose an improvement to dictionary features called smoothed dictionary features. These features recognize document contexts instead of n-grams. We describe a principled methodology to solicit dictionary features from a teacher, and present results showing that models built using these human-comprehensible features are competitive with models trained with Bag of Words features.
Jun-23-2016
- Country:
- North America > United States > Wisconsin (0.14)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Education (0.47)
- Technology: