FinStat2SQL: A Text2SQL Pipeline for Financial Statement Analysis
Nguyen, Quang Hung, Trinh, Phuong Anh, Mai, Phan Quoc Hung, Trinh, Tuan Phong
–arXiv.org Artificial Intelligence
Despite the advancements of large language models, text2sql still faces many challenges, particularly with complex and domain-specific queries. In finance, database designs and financial reporting layouts vary widely between financial entities and countries, making text2sql even more challenging. We present FinStat2SQL, a lightweight text2sql pipeline enabling natural language queries over financial statements. Tailored to local standards like VAS, it combines large and small language models in a multi-agent setup for entity extraction, SQL generation, and self-correction. We build a domain-specific database and evaluate models on a synthetic QA dataset. A fine-tuned 7B model achieves 61.33\% accuracy with sub-4-second response times on consumer hardware, outperforming GPT-4o-mini. FinStat2SQL offers a scalable, cost-efficient solution for financial analysis, making AI-powered querying accessible to Vietnamese enterprises.
arXiv.org Artificial Intelligence
Nov-18-2025
- Country:
- Asia
- Middle East > Jordan (0.04)
- Vietnam (0.05)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Germany > Baden-Württemberg
- Freiburg (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Belgium > Brussels-Capital Region
- North America > United States (0.04)
- Asia
- Genre:
- Overview (0.68)
- Research Report (0.82)
- Industry:
- Banking & Finance (1.00)
- Technology: