Technique to Baseline QE Artefact Generation Aligned to Quality Metrics

Farchi, Eitan, Nayak, Kiran, Majumdar, Papia Ghosh, Route, Saritha

Nov-21-2025–arXiv.org Artificial Intelligence

Large Language Models (LLMs) are transforming Quality Engineering (QE) by automating the generation of artefacts such as requirements, test cases, and Behavior Driven Development (BDD) scenarios. However, ensuring the quality of these outputs remains a challenge. This paper presents a systematic technique to baseline and evaluate QE artefacts using quantifiable metrics. The approach combines LLM-driven generation, reverse generation , and iterative refinement guided by rubrics technique for clarity, completeness, consistency, and testability. Experimental results across 12 projects show that reverse-generated artefacts can outperform low-quality inputs and maintain high standards when inputs are strong. The framework enables scalable, reliable QE artefact validation, bridging automation with accountability.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

Nov-21-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.47)

Industry:
- Energy (0.68)
- Law (0.68)
- Information Technology (0.48)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found