Exploring Linguistic Probes for Morphological Generalization
Kodner, Jordan, Khalifa, Salam, Payne, Sarah
–arXiv.org Artificial Intelligence
SIGMORPHON and SIGMORPHON-UniMorph Three languages were chosen whose inflectional shared tasks (Cotterell et al., 2016, 2017, 2018; morphologies range from entirely fusional (English), McCarthy et al., 2019; Vylomova et al., 2020; Pimentel to mixed (Spanish), to mostly agglutinative et al., 2021; Kodner et al., 2022) as well (Swahili). In highly agglutinative languages, individual as in more targeted studies focused on specific languages features in a set tend to correspond to distinct or the generalization behavior of computational morphological patterns, so a model may generalize models (Goldman et al., 2022; Wiemerslage to unseen feature sets by mapping component et al., 2022; Kodner et al., 2023b; Guriel et al., features to their corresponding patterns. This is 2023; Kodner et al., 2023a), is to train on (lemma, exemplified by the Swahili example (1), in which inflection, features) triples and predict inflected most features correspond to individual morphemes; forms from held-out (lemma, features) only the person/number prefix maps to more than pairs.
arXiv.org Artificial Intelligence
Oct-20-2023
- Country:
- Oceania > Australia (0.04)
- North America
- United States
- Washington > King County
- Seattle (0.04)
- New York > Suffolk County
- Stony Brook (0.04)
- Washington > King County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Germany > Berlin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Hungary > Jász-Nagykun-Szolnok County
- Szolnok (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Asia
- Middle East
- Jordan (0.05)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Middle East
- Genre:
- Research Report (0.64)
- Technology: