Text-to-Pipeline: Bridging Natural Language and Data Preparation Pipelines
Ge, Yuhang, Liu, Yachuan, Ye, Zhangyan, Mao, Yuren, Gao, Yunjun
–arXiv.org Artificial Intelligence
Data preparation (DP) transforms raw data into a form suitable for downstream applications, typically by composing operations into executable pipelines. Building such pipelines is time-consuming and requires sophisticated programming skills, posing a significant barrier for non-experts. To lower this barrier, we introduce Text-to-Pipeline, a new task that translates NL data preparation instructions into DP pipelines, and PARROT, a large-scale benchmark to support systematic evaluation. To ensure realistic DP scenarios, PARROT is built by mining transformation patterns from production pipelines and instantiating them on 23,009 real-world tables, resulting in ~18,000 tasks spanning 16 core operators. Our empirical evaluation on PARROT reveals a critical failure mode in cutting-edge LLMs: they struggle not only with multi-step compositional logic but also with semantic parameter grounding. We thus establish a strong baseline with Pipeline-Agent, an execution-aware agent that iteratively reflects on intermediate states. While it achieves state-of-the-art performance, a significant gap remains, underscoring the deep, unsolved challenges for PARROT. It provides the essential, large-scale testbed for developing and evaluating the next generation of autonomous data preparation agentic systems.
arXiv.org Artificial Intelligence
Nov-11-2025
- Country:
- Europe
- Finland
- Southwest Finland > Turku (0.04)
- Uusimaa > Helsinki (0.04)
- Germany (0.04)
- Norway (0.04)
- United Kingdom (0.04)
- Finland
- North America
- Canada (0.05)
- United States (0.04)
- South America > Chile (0.04)
- Europe
- Genre:
- Research Report (0.63)
- Workflow (1.00)
- Technology: