BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra
Glenn, Parker, Dakle, Parag Pravin, Wang, Liang, Raghavan, Preethi
–arXiv.org Artificial Intelligence
Many existing end-to-end systems for hybrid question answering tasks can often be boiled down to a "prompt-and-pray" paradigm, where the user has limited control and insight into the intermediate reasoning steps used to achieve the final result. Additionally, due to the context size limitation of many transformer-based LLMs, it is often not reasonable to expect that the full structured and unstructured context will fit into a given prompt in a zero-shot setting, let alone a few-shot setting. We introduce BlendSQL, a superset of SQLite to act as a unified dialect for orchestrating reasoning across both unstructured and structured data. For hybrid question answering tasks involving multi-hop reasoning, we encode the full decomposed reasoning roadmap into a single interpretable BlendSQL query. Notably, we show that BlendSQL can scale to massive datasets and improve the performance of end-to-end systems while using 35% fewer tokens. Our code is available and installable as a package at https://github.com/parkervg/blendsql.
arXiv.org Artificial Intelligence
Jun-10-2024
- Country:
- Asia
- China > Beijing
- Beijing (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- China > Beijing
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Spain (0.04)
- Belgium > Brussels-Capital Region
- North America
- Belize > Belize District
- Belize City (0.04)
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- United States
- California (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- Belize > Belize District
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Leisure & Entertainment > Sports (0.93)
- Technology: