Polar Ducks and Where to Find Them: Enhancing Entity Linking with Duck Typing and Polar Box Embeddings
Atzeni, Mattia, Plekhanov, Mikhail, Dreyer, Frédéric A., Kassner, Nora, Merello, Simone, Martin, Louis, Cancedda, Nicola
–arXiv.org Artificial Intelligence
Entity linking methods based on dense retrieval are an efficient and widely used solution in large-scale applications, but they fall short of the performance of generative models, as they are sensitive to the structure of the embedding space. In order to address this issue, this paper introduces DUCK, an approach to infusing structural information in the space of entity representations, using prior knowledge of entity types. Inspired by duck typing in programming languages, we propose to define the type of an entity based on the relations that it has with other entities in a knowledge graph. Then, porting the concept of box embeddings to spherical polar coordinates, we propose to represent relations as boxes on the hypersphere. We optimize the model to cluster entities of similar type by placing them inside the boxes corresponding to their relations. Our experiments show that our method sets new state-of-the-art results on standard entity-disambiguation benchmarks, it improves the performance of the model by up to 7.9 F1 points, outperforms other type-aware approaches, and matches the results of generative models with 18 times more parameters.
arXiv.org Artificial Intelligence
Oct-20-2023
- Country:
- Oceania > Australia (0.04)
- South America
- Brazil > Rio de Janeiro
- Rio de Janeiro (0.04)
- Argentina > Pampas
- Buenos Aires F.D. > Buenos Aires (0.04)
- Brazil > Rio de Janeiro
- North America
- Panama (0.04)
- Mexico (0.04)
- United States
- Oregon (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.05)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Colorado > Denver County
- Denver (0.04)
- California
- San Francisco County > San Francisco (0.04)
- Napa County (0.04)
- Los Angeles County > Los Angeles (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- France (0.28)
- Lithuania (0.04)
- Austria (0.04)
- Poland (0.04)
- Romania (0.04)
- Germany > Berlin (0.04)
- Finland (0.04)
- Monaco (0.04)
- Norway (0.04)
- Russia (0.04)
- Greece (0.04)
- Czechia > Prague (0.04)
- Hungary (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Bulgaria
- Yambol Province > Yambol (0.04)
- Vratsa Province > Vratsa (0.04)
- Turgovishte Province > Targovishte (0.04)
- Sofia City Province > Sofia (0.04)
- Ruse Province > Ruse (0.04)
- Pazardzhik Province > Pazardzhik (0.04)
- Kyustendil Province > Kyustendil (0.04)
- Spain
- Galicia > Madrid (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Serbia
- Central Serbia > Belgrade (0.04)
- Šumadija and Western Serbia > Šumadija District
- Kragujevac (0.04)
- Vojvodina > South Banat District
- Pančevo (0.04)
- Italy
- Tuscany > Florence (0.04)
- Piedmont > Turin Province
- Turin (0.04)
- Emilia-Romagna > Metropolitan City of Bologna
- Bologna (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Russia (0.04)
- China > Hong Kong (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ankara Province > Ankara (0.04)
- Africa > Middle East
- Egypt > Cairo Governorate > Cairo (0.04)
- Genre:
- Research Report (0.82)
- Industry:
- Media (1.00)
- Leisure & Entertainment > Sports
- Soccer (1.00)
- Government > Regional Government
- Technology: