Retrieval-augmented reasoning with lean language models

Chan, Ryan Sze-Yin, Nanni, Federico, Lazauskas, Tomas, Wood, Rosie, Yong, Penelope, Tarassenko, Lionel, Girolami, Mark, Geddes, James, Duncan, Andrew

Aug-18-2025–arXiv.org Artificial Intelligence

This technical report details a novel approach to combining reasoning and retrieval augmented generation (RAG) within a single, lean language model architecture. While existing RAG systems typically rely on large-scale models and external APIs, our work addresses the increasing demand for performant and privacy-preserving solutions deployable in resource-constrained or secure environments. Building on recent developments in test-time scaling and small-scale reasoning models, we develop a retrieval augmented conversational agent capable of interpreting complex, domain-specific queries using a lightweight backbone model. Our system integrates a dense retriever with fine-tuned Qwen2.5-Instruct models, using synthetic query generation and reasoning traces derived from frontier models (e.g., DeepSeek-R1) over a curated corpus, in this case, the NHS A-to-Z condition pages. We explore the impact of summarisation-based document compression, synthetic data design, and reasoning-aware fine-tuning on model performance. Evaluation against both non-reasoning and general-purpose lean models demonstrates that our domain-specific fine-tuning approach yields substantial gains in answer accuracy and consistency, approaching frontier-level performance while remaining feasible for local deployment. All implementation details and code are publicly released to support reproducibility and adaptation across domains.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Aug-18-2025

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom (1.00)

Genre:
- Overview (1.00)
- Research Report (0.84)

Industry:
- Information Technology (1.00)
- Health & Medicine > Therapeutic Area
  - Neurology (0.67)
- Government > Regional Government
  - Europe Government > United Kingdom Government (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found