Hierarchical Sequence Iteration for Heterogeneous Question Answering
Yang, Ruiyi, Xue, Hao, Razzak, Imran, Hacid, Hakim, Salim, Flora D.
–arXiv.org Artificial Intelligence
Retrieval-augmented generation (RAG) remains brittle on multi-step questions and heterogeneous evidence sources, trading accuracy against latency and token/tool budgets. This paper introduces Hierarchical Sequence (HSEQ) Iteration for Heterogeneous Question Answering, a unified framework that (i) linearize documents, tables, and knowledge graphs into a reversible hierarchical sequence with lightweight structural tags, and (ii) perform structure-aware iteration to collect just-enough evidence before answer synthesis. A Head Agent provides guidance that leads retrieval, while an Iteration Agent selects and expands HSeq via structure-respecting actions (e.g., parent/child hops, table row/column neighbors, KG relations); Finally the head agent composes canonicalized evidence to genearte the final answer, with an optional refinement loop to resolve detected contradictions. Besides, HSEQ exhibits three key advantages: (1) a format-agnostic unification that enables a single policy to operate across text, tables, and KGs without per-dataset specialization; (2) guided, budget-aware iteration that reduces unnecessary hops, tool calls, and tokens while preserving accuracy; and (3) evidence canonicalization for reliable QA, improving answers consistency and auditability. Large language models (LLMs), such as ChatGPT (Achiam et al., 2023), LLaMA (Dubey et al., 2024), Falcon (Zuo et al., 2025), have been increasingly relying on retrieval-augmented generation (RAG) to ground answers in external evidence. With reliable supplementary knowledge offered factual errors are reduced, especially in domain-specific questions, leading to higher accuracy and fewer hallucinations (Zhu et al., 2021b; Gao et al., 2023; Zhao et al., 2024). However they may fall with branchy plans, repeated web/file calls, and verbose chain-of-thought prompts, yielding unpredictable token/tool costs and latency; termination is often heuristic, leading to premature answers or extra wasted loops with budgets decoupled from the evidence actually inspected (Singh et al., 2025). Although existing heterogeneous RAG systems (Y u, 2022; Christmann & Weikum, 2024) are available to deal with multiple formats of data, they may still face issues in either weak alignment across representations or lossy and non-reversible serialization that obscures provenance and blocks faithful reconstruction. Hierarchical Sequence Iteration (HSEQ) for Heterogeneous Question Answering introduces a reversible hierarchical sequence interface that linearizes documents, tables, and KGs into a sequence of typed segments with lightweight structure (e.g., parent/child locality, offsets or coordinates, minimal schema/time tags).
arXiv.org Artificial Intelligence
Oct-24-2025
- Country:
- Oceania > Australia
- New South Wales (0.28)
- Asia > Middle East
- UAE (0.28)
- Oceania > Australia
- Genre:
- Research Report > New Finding (0.92)
- Technology: