STRuCT-LLM: Unifying Tabular and Graph Reasoning with Reinforcement Learning for Semantic Parsing

Stoisser, Josefa Lia, Martell, Marc Boubnovski, Phillips, Lawrence, Hansen, Casper, Fauqueur, Julien

Jun-30-2025–arXiv.org Artificial Intelligence

We propose STRuCT-LLM, a unified framework for training large language models (LLMs) to perform structured reasoning over both relational and graph-structured data. Our approach jointly optimizes Text-to-SQL and Text-to-Cypher tasks using reinforcement learning (RL) combined with Chain-of-Thought (CoT) supervision. To support fine-grained optimization in graph-based parsing, we introduce a topology-aware reward function based on graph edit distance. Unlike prior work that treats relational and graph formalisms in isolation, STRuCT-LLM leverages shared abstractions between SQL and Cypher to induce cross-formalism transfer, enabling SQL training to improve Cypher performance and vice versa--even without shared schemas. Our largest model (QwQ-32B) achieves substantial relative improvements across tasks: on semantic parsing, Spider improves by 13.5% and Text2Cypher by 73.1%. The model also demonstrates strong zero-shot generalization, improving performance on downstream tabular QA (TableBench: 8.5%) and knowledge graph QA (CR-LT-KGQA: 1.7%) without any QA-specific supervision. These results demonstrate both the effectiveness of executable queries as scaffolds for structured reasoning and the synergistic benefits of jointly training on SQL and Cypher (code available at https://github.com/bouv/ Listing order is random. 1 1 Introduction Large language models (LLMs) demonstrate impressive fluency in open-domain generation but often falter on structured reasoning tasks involving tables and graphs [12, 6]. Structured reasoning requires models to ground entities, compose symbolic constraints, and follow logical paths--skills crucial for interacting with real-world data systems such as relational databases and knowledge graphs (KGs) [16, 24]. We view executable semantic parsing--specifically, Text-to-SQL and Text-to-Cypher--as a gateway to this broader capability [32, 23]. While Text-to-SQL is well-studied, Text-to-Cypher remains underexplored, offering a valuable testbed for graph reasoning.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jun-30-2025

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East > UAE (0.46)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Grammars & Parsing (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found