A Paradigm Gap in Urdu

Adeeba, Farah, Bhatt, Rajesh

arXiv.org Artificial Intelligence 

In this paper, we document a paradigm gap in the combinatorial possibilities of verbs and aspect in Urdu: the perfective form of the -y a: kar construction (e.g., ro-y a: ki: 'cry-Pfv do.Pfv') is sharply ungrammatical in modern Urdu and Hindi, despite being freely attested in 19th-century literature. We investigate this diachronic shift through historical text analysis, a large-scale corpus study--which confirms the stark absence of perfective forms--and subjective evaluation tasks with native speakers, who judge perfective examples as highly unnatural. We argue that this gap arose from a fundamental morphosyntactic conflict: the construction's requirement for a nominative subject and an invariant participle clashes with the core grammatical rule that transitive perfectives assign ergative case. This conflict rendered the perfective form unstable, and its functional replacement by other constructions allowed the gap to become entrenched in the modern grammar. 1 Introduction Human languages are dynamic systems that continually evolve, resulting in the emergence, change, and sometimes complete disappearance of morphological and grammatical structures. Within this diachronic landscape, the phenomenon of paradigm gaps --systematic absences of expected word forms or constructions--presents a particularly intriguing puzzle for linguistic theory (Albright 2003, Sims 2006, Bermel & Knittl 2012). Such gaps challenge models of language production, acquisition, and change, as they represent a failure of the grammar to generate a logically possible form.