WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Liu, Xiao, Lai, Hanyu, Yu, Hao, Xu, Yifan, Zeng, Aohan, Du, Zhengxiao, Zhang, Peng, Dong, Yuxiao, Tang, Jie
–arXiv.org Artificial Intelligence
We present WebGLM, a web-enhanced question-answering system based on the General Language Model (GLM). Its goal is to augment a pre-trained large language model (LLM) with web search and retrieval capabilities while being efficient for real-world deployments. To achieve this, we develop WebGLM with strategies for the LLM-augmented retriever, bootstrapped generator, and human preference-aware scorer. Specifically, we identify and address the limitations of WebGPT (OpenAI), through which WebGLM is enabled with accuracy, efficiency, and cost-effectiveness advantages. In addition, we propose systematic criteria for evaluating web-enhanced QA systems. We conduct multi-dimensional human evaluation and quantitative ablation studies, which suggest the outperformance of the proposed WebGLM designs over existing systems. WebGLM with the 10-billion-parameter GLM (10B) is shown to perform better than the similar-sized WebGPT (13B) and even comparably to WebGPT (175B) in human evaluation. The code, demo, and data are at \url{https://github.com/THUDM/WebGLM}.
arXiv.org Artificial Intelligence
Jun-13-2023
- Country:
- South America > Brazil
- Federal District > Brasília (0.04)
- Oceania > Australia
- Australian Capital Territory > Canberra (0.04)
- North America
- United States
- South Dakota (0.04)
- Missouri (0.04)
- New York > New York County
- New York City (0.04)
- Florida
- Leon County > Tallahassee (0.04)
- Escambia County > Pensacola (0.04)
- California > Los Angeles County
- Long Beach (0.07)
- Nicaragua > Managua
- Managua (0.04)
- United States
- Europe > Spain
- Asia
- India (0.04)
- Middle East > Republic of Türkiye
- Ankara Province > Ankara (0.04)
- China > Beijing
- Beijing (0.05)
- South America > Brazil
- Genre:
- Research Report (1.00)
- Technology: