IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code Programs

Mar-18-2025, 00:19:51 GMT–Neural Information Processing Systems

Infrastructure-as-Code (IaC), an important component of cloud computing, allows the definition of cloud infrastructure in high-level programs. However, developing IaC programs is challenging, complicated by factors that include the burgeoning complexity of the cloud ecosystem (e.g., diversity of cloud services and workloads), and the relative scarcity of IaC-specific code examples and public repositories. While large language models (LLMs) have shown promise in general code generation and could potentially aid in IaC development, no benchmarks currently exist for evaluating their ability to generate IaC code. IaC-Eval's dataset includes 458 human-curated scenarios covering a wide range of popular AWS services, at varying difficulty levels. Each scenario mainly comprises a natural language IaC problem description and an infrastructure intent specification.

cloud infrastructure-as-code program, code generation benchmark, iac-eval, (1 more...)

Neural Information Processing Systems

Mar-18-2025, 00:19:51 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Services (0.99)

Technology:
- Information Technology
  - Artificial Intelligence
    - Natural Language > Large Language Model (0.87)
    - Representation & Reasoning > Automatic Programming (0.70)
  - Cloud Computing (1.00)