Automatic Wordnet Development for Low-Resource Languages using Cross-Lingual WSD
Taghizadeh, Nasrin, Faili, Hesham
–Journal of Artificial Intelligence Research
Wordnets are an effective resource for natural language processing and information retrieval, especially for semantic processing and meaning related tasks. So far, wordnets have been constructed for many languages. However, the automatic development of wordnets for low-resource languages has not been well studied. In this paper, an Expectation-Maximization algorithm is used to create high quality and large scale wordnets for poorresource languages. The proposed method benefits from possessing cross-lingual word sense disambiguation and develops a wordnet by only using a bi-lingual dictionary and a monolingual corpus. The proposed method has been executed with Persian language and the resulting wordnet has been evaluated through several experiments. The results show that the induced wordnet has a precision score of 90% and a recall score of 35%.
Journal of Artificial Intelligence Research
May-20-2016
- Country:
- North America > United States (0.04)
- Europe
- Sweden > Uppsala County
- Uppsala (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Italy
- Liguria > Genoa (0.04)
- Emilia-Romagna > Metropolitan City of Bologna
- Bologna (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Sweden > Uppsala County
- Asia
- South Korea (0.04)
- Thailand > Phuket
- Phuket (0.04)
- Middle East
- UAE
- Sharjah Emirate > Sharjah (0.04)
- Dubai Emirate > Dubai (0.04)
- Iran > Tehran Province
- Tehran (0.04)
- UAE
- India
- Maharashtra > Mumbai (0.04)
- NCT
- China
- Heilongjiang Province > Harbin (0.04)
- Beijing > Beijing (0.04)
- Africa > Middle East
- Morocco (0.04)
- Egypt > Cairo Governorate
- Cairo (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Workflow (0.93)