Unsupervised Protoform Reconstruction through Parsimonious Rule-guided Heuristics and Evolutionary Search
–arXiv.org Artificial Intelligence
We propose an unsupervised method for the reconstruction of protoforms i.e., ancestral word forms from which modern language forms are derived. While prior work has primarily relied on probabilistic models of phonological edits to infer protoforms from cognate sets, such approaches are limited by their p redominantly data - driven nature. In contrast, our model integrates data - driven inference with rule - based heuristics within an evolutionary optimization framework. This hybrid approach leverages on both statistical patterns and linguistically motivat ed constraints to guide the reconstruction process. We evaluate our method on the task of reconstructing Latin protoforms using a dataset of cognates from five Romance languages. Experimental results demonstrate substantial improvements over established ba selines across both character - level accuracy and phonological plausibility metrics. Keywords: protoform reconstruction, historical linguistics, evolutionary algorithms, phonological modeling, rule - based inference .
arXiv.org Artificial Intelligence
Jun-13-2025
- Country:
- Europe
- Sweden (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- North America > Canada
- Europe
- Genre:
- Research Report > New Finding (0.34)
- Technology: