Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?
Das, Kunal Kingkar, Jagadeeshan, Manoj Balaji, Sahith, Nallani Chakravartula, Sandhan, Jivnesh, Goyal, Pawan
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) are increasingly treated as universal, general-purpose solutions across NLP tasks, particularly in English. But does this assumption hold for low-resource, morphologically rich languages such as Sanskrit? We address this question by comparing instruction-tuned and in-context-prompted LLMs with smaller task-specific encoder-decoder models on the Sanskrit poetry-to-prose conversion task. This task is intrinsically challenging: Sanskrit verse exhibits free word order combined with rigid metrical constraints, and its conversion to canonical prose (anvaya) requires multi-step reasoning involving compound segmentation, dependency resolution, and syntactic linearisation. This makes it an ideal testbed to evaluate whether LLMs can surpass specialised models. For LLMs, we apply instruction fine-tuning on general-purpose models and design in-context learning templates grounded in Paninian grammar and classical commentary heuristics. For task-specific modelling, we fully fine-tune a ByT5-Sanskrit Seq2Seq model. Our experiments show that domain-specific fine-tuning of ByT5-Sanskrit significantly outperforms all instruction-driven LLM approaches. Human evaluation strongly corroborates this result, with scores exhibiting high correlation with Kendall's Tau scores. Additionally, our prompting strategies provide an alternative to fine-tuning when domain-specific verse corpora are unavailable, and the task-specific Seq2Seq model demonstrates robust generalisation on out-of-domain evaluations.
arXiv.org Artificial Intelligence
Nov-12-2025
- Country:
- Asia
- China (0.04)
- India
- Karnataka > Bengaluru (0.04)
- West Bengal > Kharagpur (0.04)
- Japan > Honshū
- Kansai
- Kyoto Prefecture > Kyoto (0.04)
- Osaka Prefecture > Osaka (0.04)
- Kansai
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Nepal > Bagmati Province
- Kathmandu District > Kathmandu (0.04)
- Singapore (0.04)
- Europe > Croatia
- Dubrovnik-Neretva County > Dubrovnik (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- South Carolina (0.04)
- California > Santa Clara County
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (0.68)
- Technology: