Prompting as Probing: Using Language Models for Knowledge Base Construction
Alivanistos, Dimitrios, Santamaría, Selene Báez, Cochez, Michael, Kalo, Jan-Christoph, van Krieken, Emile, Thanapalasingam, Thiviyan
–arXiv.org Artificial Intelligence
Language Models (LMs) have proven to be useful in various downstream applications, such as summarisation, translation, question answering and text classification. LMs are becoming increasingly important tools in Artificial Intelligence, because of the vast quantity of information they can store. In this work, we present ProP (Prompting as Probing), which utilizes GPT-3, a large Language Model originally proposed by OpenAI in 2020, to perform the task of Knowledge Base Construction (KBC). ProP implements a multi-step approach that combines a variety of prompting techniques to achieve this. Our results show that manual prompt curation is essential, that the LM must be encouraged to give answer sets of variable lengths, in particular including empty answer sets, that true/false questions are a useful device to increase precision on suggestions generated by the LM, that the size of the LM is a crucial factor, and that a dictionary of entity aliases improves the LM score. Our evaluation study indicates that these proposed techniques can substantially enhance the quality of the final predictions: ProP won track 2 of the LM-KBC competition, outperforming the baseline by 36.4 percentage points.
arXiv.org Artificial Intelligence
Jun-19-2023
- Country:
- Africa
- Asia
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- North Korea (0.04)
- Middle East
- Bahrain (0.04)
- Iran (0.04)
- Oman > Muscat Governorate
- Muscat (0.04)
- Qatar (0.04)
- Saudi Arabia (0.04)
- UAE (0.04)
- Russia (0.04)
- China (0.04)
- South Korea (0.04)
- Sri Lanka (0.04)
- Singapore (0.04)
- Afghanistan (0.04)
- Japan > Honshū
- Europe
- North Macedonia (0.04)
- Hungary (0.04)
- Moldova (0.04)
- Kosovo (0.04)
- Gibraltar (0.04)
- Sweden (0.04)
- Croatia
- Krapina-Zagorje County (0.04)
- Zagreb County > Zagreb (0.04)
- Ukraine (0.04)
- Belgium (0.04)
- Romania (0.04)
- Switzerland > Schwyz
- Schwyz (0.04)
- San Marino
- Borgo Maggiore > Borgo Maggiore (0.04)
- Fiorentino > Fiorentino (0.04)
- Russia (0.04)
- Italy
- Emilia-Romagna (0.04)
- Liguria (0.04)
- Tuscany (0.04)
- France (0.04)
- Serbia (0.04)
- Slovenia (0.04)
- United Kingdom > England (0.04)
- Albania (0.04)
- Spain
- Andalusia (0.04)
- Castilla-La Mancha (0.14)
- Ceuta (0.04)
- Extremadura (0.04)
- Melilla (0.04)
- Region of Murcia > Murcia (0.04)
- Germany
- Lower Saxony (0.04)
- Schleswig-Holstein (0.04)
- Poland (0.04)
- Bulgaria (0.04)
- Austria (0.04)
- Bosnia and Herzegovina (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Montenegro (0.04)
- North America
- Turks and Caicos Islands (0.04)
- Cuba (0.04)
- Haiti (0.04)
- The Bahamas (0.04)
- Canada (0.04)
- United States
- Jamaica (0.04)
- Mexico (0.04)
- Barbados (0.04)
- Dominica (0.04)
- Trinidad and Tobago (0.04)
- Oceania
- South America
- Genre:
- Research Report > New Finding (0.86)
- Industry:
- Automobiles & Trucks > Manufacturer (0.93)
- Government (0.68)
- Health & Medicine (0.68)
- Leisure & Entertainment > Sports (0.68)
- Media (1.00)
- Technology: