DIM: Dynamic Integration of Multimodal Entity Linking with Large Language Model

Song, Shezheng, Li, Shasha, Yu, Jie, Zhao, Shan, Li, Xiaopeng, Ma, Jun, Liu, Xiaodong, Li, Zhuo, Mao, Xiaoguang

Jun-27-2024–arXiv.org Artificial Intelligence

Our study delves into Multimodal Entity Linking, aligning the mention in multimodal information with entities in knowledge base. Existing methods are still facing challenges like ambiguous entity representations and limited image information utilization. Thus, we propose dynamic entity extraction using ChatGPT, which dynamically extracts entities and enhances datasets. We also propose a method: Dynamically Integrate Multimodal information with knowledge base (DIM), employing the capability of the Large Language Model (LLM) for visual understanding. The LLM, such as BLIP-2, extracts information relevant to entities in the image, which can facilitate improved extraction of entity features and linking them with the dynamic entity representations provided by ChatGPT. The experiments demonstrate that our proposed DIM method outperforms the majority of existing methods on the three original datasets, and achieves state-of-the-art (SOTA) on the dynamically enhanced datasets (Wiki+, Rich+, Diverse+).

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Jun-27-2024

arXiv.org PDF

Add feedback

Country:
- Asia > China
  - Anhui Province > Hefei (0.04)
  - Guangdong Province > Guangzhou (0.04)
- Europe
  - France > Bourgogne-Franche-Comté
    - Doubs > Besançon (0.04)
  - Spain > Galicia
    - Madrid (0.04)
- North America > United States
  - Pennsylvania > Lackawanna County > Scranton (0.04)

Genre:
- Research Report (0.50)

Industry:
- Government > Regional Government
  - North America Government > United States Government (1.00)
- Media > Music (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found