ScenarioBench: Trace-Grounded Compliance Evaluation for Text-to-SQL and RAG

Open in new window