Parsing Thai Social Data: A New Challenge for Thai NLP
Singkul, Sattaya, Khampingyot, Borirat, Maharattamalai, Nattasit, Taerungruang, Supawat, Chalothorn, Tawunrat
–arXiv.org Artificial Intelligence
Dependency parsing (DP) is a task that analyzes text for syntactic structure and relationship between words. DP is widely used to improve natural language processing (NLP) applications in many languages such as English. Previous works on DP are generally applicable to formally written languages. However, they do not apply to informal languages such as the ones used in social networks. Therefore, DP has to be researched and explored with such social network data. In this paper, we explore and identify a DP model that is suitable for Thai social network data. After that, we will identify the appropriate linguistic unit as an input. The result showed that, the transition based model called, improve Elkared dependency parser outperform the others at UAS of 81.42%.
arXiv.org Artificial Intelligence
Mar-6-2020
- Country:
- North America
- United States
- Maryland > Baltimore (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- New York
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Georgia > Clarke County
- Athens (0.04)
- Canada > British Columbia
- United States
- Europe
- Germany > Berlin (0.04)
- Slovenia (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain
- Galicia > Madrid (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Italy > Liguria
- Genoa (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Switzerland > Geneva
- Geneva (0.04)
- Denmark > North Jutland
- Aalborg (0.04)
- Asia
- South Korea (0.04)
- Singapore (0.04)
- Thailand > Chiang Mai
- Chiang Mai (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Middle East
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Republic of Türkiye > Istanbul Province
- Japan > Honshū
- Kansai > Osaka Prefecture > Osaka (0.04)
- China > Beijing
- Beijing (0.04)
- North America
- Genre:
- Research Report > New Finding (0.54)
- Industry:
- Information Technology (1.00)
- Technology: