Transition-Based Neural Word Segmentation Using Word-Level Features
Zhang, Meishan, Zhang, Yue, Fu, Guohong
–Journal of Artificial Intelligence Research
Character-based and word-based methods are two different solutions for Chinese word segmentation, the former exploiting sequence labeling models over characters and the latter using word-level features. Neural models have been exploited for character-based Chinese word segmentation, giving high accuracies by making use of external character embeddings, yet requiring less feature engineering. In this paper, we study a neural model for word-based Chinese word segmentation, by replacing the manually-designed discrete features with neural features in a transition-based word segmentation framework. Experimental results demonstrate that word features lead to comparable performance to the best systems in the literature, and a further combination of discrete and neural features obtains top accuracies on several benchmarks.
Journal of Artificial Intelligence Research
Dec-23-2018
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America > United States
- Maryland > Baltimore (0.04)
- Washington > King County
- Seattle (0.04)
- Europe
- Germany > Berlin (0.04)
- Czechia > Prague (0.04)
- Switzerland > Geneva
- Geneva (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Greece > Attica
- Athens (0.04)
- Asia
- Thailand > Chiang Mai
- Chiang Mai (0.04)
- Japan > Honshū
- Chūbu > Aichi Prefecture > Nagoya (0.04)
- China
- Heilongjiang Province > Harbin (0.04)
- Zhejiang Province > Hangzhou (0.04)
- Thailand > Chiang Mai
- Oceania > Australia
- Genre:
- Workflow (0.68)
- Research Report > New Finding (0.66)
- Technology: