Multi-teacher Knowledge Distillation for Knowledge Graph Completion
Wang, Kai, Liu, Yu, Ma, Qian, Sheng, Quan Z.
–arXiv.org Artificial Intelligence
Link prediction based on knowledge graph embedding (KGE) aims to predict new triples to complete knowledge graphs (KGs) automatically. However, recent KGE models tend to improve performance by excessively increasing vector dimensions, which would cause enormous training costs and save storage in practical applications. To address this problem, we first theoretically analyze the capacity of low-dimensional space for KG embeddings based on the principle of minimum entropy. Then, we propose a novel knowledge distillation framework for knowledge graph embedding, utilizing multiple low-dimensional KGE models as teachers. Under a novel iterative distillation strategy, the MulDE model produces soft labels according to training epochs and student performance adaptively. The experimental results show that MulDE can effectively improve the performance and training speed of low-dimensional KGE models. The distilled 32-dimensional models are very competitive compared to some of state-or-the-art (SotA) high-dimensional methods on several commonly-used datasets.
arXiv.org Artificial Intelligence
Oct-19-2020
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States > New York
- New York County > New York City (0.04)
- Canada > Alberta
- United States > New York
- Europe > Slovenia
- Central Slovenia > Municipality of Ljubljana > Ljubljana (0.05)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- China > Liaoning Province
- Dalian (0.05)
- Taiwan > Taiwan Province
- Oceania > Australia
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Education (1.00)
- Technology: