STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack
Gupta, Naman, Kirtania, Shashank, Gupta, Priyanshu, Kariya, Krishna, Gulwani, Sumit, Iyer, Arun, Parthasarathy, Suresh, Radhakrishna, Arjun, Rajamani, Sriram K., Soares, Gustavo
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) often generate incorrect or outdated information, especially in low-resource settings or when dealing with private data. To address this, Retrieval-Augmented Generation (RAG) uses external knowledge bases (KBs), but these can also suffer from inaccuracies. We introduce STACKFEED, a novel Structured Textual Actor-Critic Knowledge base editing with FEEDback approach that iteratively refines the KB based on expert feedback using a multi-actor, centralized critic reinforcement learning framework. Each document is assigned to an actor, modeled as a ReACT agent, which performs structured edits based on document-specific targeted instructions from a centralized critic. Experimental results show that STACKFEED significantly improves KB quality and RAG system performance, enhancing accuracy by up to 8% over baselines.
arXiv.org Artificial Intelligence
Oct-14-2024
- Country:
- Asia > Middle East (0.28)
- North America
- Canada (0.28)
- United States (0.28)
- Genre:
- Research Report (0.85)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.66)
- Reinforcement Learning (0.68)
- Natural Language
- Chatbot (0.66)
- Large Language Model (1.00)
- Representation & Reasoning
- Agents (1.00)
- Expert Systems (0.85)
- Search (0.68)
- Machine Learning
- Information Technology > Artificial Intelligence