Labeled Morphological Segmentation with Semi-Markov Models
Cotterell, Ryan, Müller, Thomas, Fraser, Alexander, Schütze, Hinrich
–arXiv.org Artificial Intelligence
We present labeled morphological segmentation, an alternative view of morphological processing that unifies several tasks. From an annotation standpoint, we additionally introduce a new hierarchy of morphotactic tagsets. Finally, we develop \modelname, a discriminative morphological segmentation system that, contrary to previous work, explicitly models morphotactics. We show that \textsc{chipmunk} yields improved performance on three tasks for all six languages: (i) morphological segmentation, (ii) stemming and (iii) morphological tag classification. On morphological segmentation, our method shows absolute improvements of 2--6 points $F_1$ over the baseline.
arXiv.org Artificial Intelligence
Apr-13-2024
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America > United States
- Massachusetts (0.04)
- Maryland (0.04)
- Washington > King County
- Seattle (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- New York > Monroe County
- Rochester (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- Europe
- Sweden
- Uppsala County > Uppsala (0.04)
- Vaestra Goetaland > Gothenburg (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Sweden
- Asia
- Africa > Middle East
- Morocco (0.04)
- Oceania > Australia
- Genre:
- Research Report (0.64)
- Technology: