NEMO: Frequentist Inference Approach to Constrained Linguistic Typology Feature Prediction in SIGTYP 2020 Shared Task
Gutkin, Alexander, Sproat, Richard
–arXiv.org Artificial Intelligence
This paper describes the NEMO submission to SIGTYP 2020 shared task which deals with prediction of linguistic typological features for multiple languages using the data derived from World Atlas of Language Structures (WALS). We employ frequentist inference to represent correlations between typological features and use this representation to train simple multi-class estimators that predict individual features. We describe two submitted ridge regression-based configurations which ranked second and third overall in the constrained task. Our best configuration achieved the micro-averaged accuracy score of 0.66 on 149 test languages.
arXiv.org Artificial Intelligence
Oct-12-2020
- Country:
- Oceania > Australia
- North America
- Central America (0.04)
- United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- California > San Francisco County
- San Francisco (0.04)
- Minnesota > Hennepin County
- Europe
- Czechia > Prague (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Cambridgeshire > Cambridge (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Slovenia > Drava
- Municipality of Benedikt > Benedikt (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany > Saxony
- Leipzig (0.04)
- Asia
- India (0.04)
- South Korea (0.04)
- Nepal (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Africa
- Niger (0.04)
- East Africa (0.04)
- Genre:
- Research Report > New Finding (0.47)