From Queries to Insights: Agentic LLM Pipelines for Spatio-Temporal Text-to-SQL
Redd, Manu, Zhe, Tao, Wang, Dongjie
–arXiv.org Artificial Intelligence
Natural-language-to-SQL (NL-to-SQL) systems hold promise for democratizing access to structured data, allowing users to query databases without learning SQL. Yet existing systems struggle with realistic spatio-temporal queries, where success requires aligning vague user phrasing with schema-specific categories, handling temporal reasoning, and choosing appropriate outputs. We present an agentic pipeline that extends a naive text-to-SQL baseline (llama-3-sqlcoder-8b) with orchestration by a Mistral-based ReAct agent. The agent can plan, decompose, and adapt queries through schema inspection, SQL generation, execution, and visualization tools. We evaluate on 35 natural-language queries over the NYC and Tokyo check-in dataset, covering spatial, temporal, and multi-dataset reasoning. The agent achieves substantially higher accuracy than the naive baseline 91.4% vs. 28.6% and enhances usability through maps, plots, and structured natural-language summaries. Crucially, our design enables more natural human-database interaction, supporting users who lack SQL expertise, detailed schema knowledge, or prompting skill. We conclude that agentic orchestration, rather than stronger SQL generators alone, is a promising foundation for interactive geospatial assistants.
arXiv.org Artificial Intelligence
Oct-31-2025
- Country:
- Asia
- China > Heilongjiang Province
- Daqing (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.27)
- China > Heilongjiang Province
- North America > United States
- Kansas > Douglas County
- Lawrence (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.16)
- New York > New York County
- New York City (0.04)
- Kansas > Douglas County
- Asia
- Genre:
- Research Report > New Finding (0.68)
- Technology: