TRAC: Trustworthy Retrieval Augmented Chatbot
Li, Shuo, Park, Sangdon, Lee, Insup, Bastani, Osbert
–arXiv.org Artificial Intelligence
Although conversational AIs have demonstrated fantastic performance, they often generate incorrect information, or hallucinations. Retrieval augmented generation has emerged as a promising solution to reduce these hallucinations. However, these techniques still cannot guarantee correctness. Focusing on question answering, we propose a framework that can provide statistical guarantees for the retrieval augmented question answering system by combining conformal prediction and global testing. In addition, we use Bayesian optimization to choose hyperparameters of the global test to maximize the performance of the system. Our empirical results on the Natural Questions dataset demonstrate that our method can provide the desired coverage guarantee while minimizing the average prediction set size.
arXiv.org Artificial Intelligence
Jul-6-2023
- Country:
- North America
- Jamaica (0.04)
- United States
- Pennsylvania (0.04)
- New York > New York County
- New York City (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > Santa Clara County
- Stanford (0.04)
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Croatia > Primorje-Gorski Kotar County
- Rijeka (0.04)
- Spain > Catalonia
- Asia
- Middle East > Jordan (0.04)
- India (0.04)
- North America
- Genre:
- Research Report (1.00)
- Technology: