Feature-Rich Part-of-speech Tagging for Morphologically Complex Languages: Application to Bulgarian
Georgiev, Georgi, Zhikov, Valentin, Osenova, Petya, Simov, Kiril, Nakov, Preslav
–arXiv.org Artificial Intelligence
Unlike most previous work, which has used a small number of grammatical categories, we work with 680 morpho-syntactic tags. W e combine a large morphological lexicon with prior linguistic knowledge and guided learning from a POSannotated corpus, achieving accuracy of 97.98%, which is a significant improvement over the state-of-the-art for Bulgarian.
arXiv.org Artificial Intelligence
Nov-26-2019
- Country:
- North America
- United States
- Rhode Island > Providence County
- Providence (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Rhode Island > Providence County
- Canada
- British Columbia (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 11
- Edmonton Metropolitan Region > Edmonton (0.04)
- United States
- Europe
- Czechia > Prague (0.05)
- Bulgaria
- Sofia City Province > Sofia (0.04)
- Plovdiv Province > Plovdiv (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Slovakia > Bratislava
- Bratislava (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- Greater Manchester > Manchester (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- France > Occitanie
- Haute-Garonne > Toulouse (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- Middle East > Qatar
- China > Beijing
- Beijing (0.04)
- Taiwan > Taiwan Province
- North America
- Genre:
- Research Report (0.82)
- Technology: