Benchmarking Diverse-Modal Entity Linking with Generative Models
Wang, Sijia, Li, Alexander Hanbo, Zhu, Henry, Zhang, Sheng, Hang, Chung-Wei, Perera, Pramuditha, Ma, Jie, Wang, William, Wang, Zhiguo, Castelli, Vittorio, Xiang, Bing, Ng, Patrick
–arXiv.org Artificial Intelligence
Entities can be expressed in diverse formats, such as texts, images, or column names and cell values in tables. While existing entity linking (EL) models work well on per modality configuration, such as text-only EL, visual grounding, or schema linking, it is more challenging to design a unified model for diverse modality configurations. To bring various modality configurations together, we constructed a benchmark for diverse-modal EL (DMEL) from existing EL datasets, covering all three modalities including text, image, and table. To approach the DMEL task, we proposed a generative diverse-modal model (GDMM) following a multimodal-encoder-decoder paradigm. Pre-training \Model with rich corpora builds a solid foundation for DMEL without storing the entire KB for inference. Fine-tuning GDMM builds a stronger DMEL baseline, outperforming state-of-the-art task-specific EL models by 8.51 F1 score on average. Additionally, extensive error analyses are conducted to highlight the challenges of DMEL, facilitating future research on this task.
arXiv.org Artificial Intelligence
May-26-2023
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Missouri (0.04)
- Arkansas (0.04)
- Virginia (0.04)
- Colorado (0.04)
- Pennsylvania (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Florida > Brevard County
- Cape Canaveral (0.05)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.14)
- Canada > British Columbia
- Europe
- Austria (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- France
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Asia
- Japan (0.04)
- Middle East
- Jordan (0.04)
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- China
- Genre:
- Research Report (1.00)
- Industry:
- Leisure & Entertainment > Sports > Soccer (1.00)
- Technology: