SAGE:Specification-Aware Grammar Extraction for Automated Test Case Generation with LLMs
Aditi, null, Park, Hyunwoo, Sung, Sicheol, Han, Yo-Sub, Ko, Sang-Ki
–arXiv.org Artificial Intelligence
Grammar-based test case generation has proven effective for competitive programming problems, but generating valid and general grammars from natural language specifications remains a key challenge, especially under limited supervision. Context-Free Grammars with Counters (CCFGs) have recently been introduced as a formalism to represent such specifications with logical constraints by storing and reusing counter values during derivation. In this work, we explore the use of open-source large language models (LLMs) to induce CCFGs from specifications using a small number of labeled examples and verifiable reward-guided reinforcement learning. Our approach first fine-tunes an open-source LLM to perform specification-to-grammar translation, and further applies Group Relative Policy Optimization (GRPO) to enhance grammar validity and generality. We also examine the effectiveness of iterative feedback for open and closed-source LLMs in correcting syntactic and semantic errors in generated grammars. Experimental results show that our approach SAGE achieves stronger generalization and outperforms 17 open and closed-source LLMs in both grammar quality and test effectiveness, improving over the state-of-the-art by 15.92%p in grammar validity and 12.34%p in test effectiveness. We provide our implementation and dataset at the following anonymous repository:https://anonymous.4open.science/r/SAGE-5714
arXiv.org Artificial Intelligence
Jun-16-2025
- Country:
- Africa > Rwanda
- Asia
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Europe
- Germany > Baden-Württemberg
- Karlsruhe Region > Karlsruhe (0.04)
- Greece (0.04)
- Hungary > Csongrád-Csanád County
- Szeged (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Germany > Baden-Württemberg
- North America > Canada
- British Columbia > Vancouver (0.04)
- Quebec > Montreal (0.04)
- Oceania > Australia
- South America > Brazil (0.04)
- Genre:
- Research Report > New Finding (0.66)
- Technology: