RelBERT: Embedding Relations with Language Models
Ushio, Asahi, Camacho-Collados, Jose, Schockaert, Steven
–arXiv.org Artificial Intelligence
Many applications need access to background knowledge about how different concepts and entities are related. Although Knowledge Graphs (KG) and Large Language Models (LLM) can address this need to some extent, KGs are inevitably incomplete and their relational schema is often too coarse-grained, while LLMs are inefficient and difficult to control. As an alternative, we propose to extract relation embeddings from relatively small language models. In particular, we show that masked language models such as RoBERTa can be straightforwardly fine-tuned for this purpose, using only a small amount of training data. The resulting model, which we call RelBERT, captures relational similarity in a surprisingly fine-grained way, allowing us to set a new state-of-the-art in analogy benchmarks. Crucially, RelBERT is capable of modelling relations that go well beyond what the model has seen during training. For instance, we obtained strong results on relations between named entities with a model that was only trained on lexical relations between concepts, and we observed that RelBERT can recognise morphological analogies despite not being trained on such examples. Overall, we find that RelBERT significantly outperforms strategies based on prompting language models that are several orders of magnitude larger, including recent GPT-based models and open source models.
arXiv.org Artificial Intelligence
Oct-8-2023
- Country:
- South America > Argentina (0.04)
- Oceania
- Australia (0.04)
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Texas > Travis County
- Austin (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Colorado > Denver County
- Denver (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Seattle (0.04)
- Maine > Kennebec County
- Waterville (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Francisco County > San Francisco (0.04)
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- New York > New York County
- New York City (0.04)
- Texas > Travis County
- Canada
- Europe
- Germany > Berlin (0.04)
- Slovenia (0.04)
- Croatia (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Galicia
- Madrid (0.04)
- Italy > Tuscany
- Florence (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- France
- Île-de-France > Paris
- Paris (0.04)
- Auvergne-Rhône-Alpes > Lyon
- Lyon (0.04)
- Île-de-France > Paris
- Norway > Eastern Norway
- Oslo (0.04)
- Serbia > Central Serbia
- Belgrade (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Japan
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Honshū > Kansai
- Osaka Prefecture > Osaka (0.04)
- Kyūshū & Okinawa > Kyūshū
- China
- Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Leisure & Entertainment (1.00)
- Education > Educational Setting
- Higher Education (0.45)
- Technology: