ChatQA: Building GPT-4 Level Conversational QA Models
Liu, Zihan, Ping, Wei, Roy, Rajarshi, Xu, Peng, Lee, Chankyu, Shoeybi, Mohammad, Catanzaro, Bryan
–arXiv.org Artificial Intelligence
In this work, we introduce ChatQA, a family of conversational question answering (QA) models that obtain GPT-4 level accuracies. Specifically, we propose a two-stage instruction tuning method that can significantly improve the zero-shot conversational QA results from large language models (LLMs). To handle retrieval-augmented generation in conversational QA, we fine-tune a dense retriever on a multi-turn QA dataset, which provides comparable results to using the state-of-the-art query rewriting model while largely reducing deployment cost. Notably, our ChatQA-70B can outperform GPT-4 in terms of average score on 10 conversational QA datasets (54.14 vs. 53.90), without relying on any synthetic data from OpenAI GPT models.
arXiv.org Artificial Intelligence
Jan-23-2024
- Country:
- Asia > Russia (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Romania (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Greece (0.04)
- Slovakia > Presov
- Prešov (0.04)
- Russia (0.04)
- France
- Grand Est > Bas-Rhin
- Strasbourg (0.04)
- Île-de-France > Paris
- Paris (0.04)
- Grand Est > Bas-Rhin
- United Kingdom > England (0.04)
- Spain (0.04)
- Germany
- Bavaria > Upper Bavaria
- Munich (0.04)
- Berlin (0.04)
- Saxony-Anhalt > Magdeburg (0.04)
- Bavaria > Upper Bavaria
- Bulgaria (0.04)
- Poland > Greater Poland Province
- Poznań (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > Indiana
- Marion County > Indianapolis (0.04)
- Canada > Quebec
- South America > Chile (0.04)
- Genre:
- Research Report (0.40)
- Industry:
- Education (1.00)
- Health & Medicine > Therapeutic Area
- Cardiology/Vascular Diseases (0.93)
- Endocrinology (1.00)
- Leisure & Entertainment > Sports (0.68)
- Technology: