WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

Liu, Xiao, Lai, Hanyu, Yu, Hao, Xu, Yifan, Zeng, Aohan, Du, Zhengxiao, Zhang, Peng, Dong, Yuxiao, Tang, Jie

Jun-13-2023–arXiv.org Artificial Intelligence

We present WebGLM, a web-enhanced question-answering system based on the General Language Model (GLM). Its goal is to augment a pre-trained large language model (LLM) with web search and retrieval capabilities while being efficient for real-world deployments. To achieve this, we develop WebGLM with strategies for the LLM-augmented retriever, bootstrapped generator, and human preference-aware scorer. Specifically, we identify and address the limitations of WebGPT (OpenAI), through which WebGLM is enabled with accuracy, efficiency, and cost-effectiveness advantages. In addition, we propose systematic criteria for evaluating web-enhanced QA systems. We conduct multi-dimensional human evaluation and quantitative ablation studies, which suggest the outperformance of the proposed WebGLM designs over existing systems. WebGLM with the 10-billion-parameter GLM (10B) is shown to perform better than the similar-sized WebGPT (13B) and even comparably to WebGPT (175B) in human evaluation. The code, demo, and data are at \url{https://github.com/THUDM/WebGLM}.

large language model, machine learning, question answering, (19 more...)

arXiv.org Artificial Intelligence

Jun-13-2023

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - Federal District > Brasília (0.04)
- Oceania > Australia
  - Australian Capital Territory > Canberra (0.04)
- North America
  - United States
    - South Dakota (0.04)
    - Missouri (0.04)
    - New York > New York County
      - New York City (0.04)
    - Florida
      - Leon County > Tallahassee (0.04)
      - Escambia County > Pensacola (0.04)
    - California > Los Angeles County
      - Long Beach (0.07)
  - Nicaragua > Managua
    - Managua (0.04)
- Europe > Spain
  - Galicia > Madrid (0.04)
- Asia
  - India (0.04)
  - Middle East > Republic of Türkiye
    - Ankara Province > Ankara (0.04)
  - China > Beijing
    - Beijing (0.05)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area
  - Infections and Infectious Diseases (0.46)
- Government > Regional Government
  - North America Government > United States Government (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Question Answering (1.00)
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found