SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection

Hu, Mengya, Xu, Rui, Lei, Deren, Li, Yaxi, Wang, Mingyu, Ching, Emily, Kamal, Eslam, Deng, Alex

Aug-22-2024–arXiv.org Artificial Intelligence

Large language models (LLMs) are highly capable but face latency challenges in real-time applications, such as conducting online hallucination detection. To overcome this issue, we propose a novel framework that leverages a small language model (SLM) classifier for initial detection, followed by a LLM as constrained reasoner to generate detailed explanations for detected hallucinated content. This study optimizes the real-time interpretable hallucination detection by introducing effective prompting techniques that align LLM-generated explanations with SLM decisions. Empirical experiment results demonstrate its effectiveness, thereby enhancing the overall user experience.

explanation, hallucination, preprint arxiv, (15 more...)

arXiv.org Artificial Intelligence

Aug-22-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.05)
- Asia > Singapore (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Leisure & Entertainment (0.47)
- Media (0.47)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found