Using Pairwise Occurrence Information to Improve Knowledge Graph Completion on Large-Scale Datasets

Balkir, Esma, Naslidnyk, Masha, Palfrey, Dave, Mittal, Arpit

Oct-25-2019–arXiv.org Machine Learning

Using Pairwise Occurrence Information to Improve Knowledge Graph Completion on Large-Scale Datasets Esma Balkır 1,2*, Masha Naslidnyk 2, Dave Palfrey 2 and Arpit Mittal 2 1 University of Edinburgh, Scotland, UK 2 Amazon Research, Cambridge, UK 1 esma.balkir@ed.ac.uk 2 { naslidny, dpalfrey, mitarpit }@amazon.co.uk Abstract Bilinear models such as DistMult and ComplEx are effective methods for knowledge graph (KG) completion. However, they require large batch sizes, which becomes a performance bottleneck when training on large scale datasets due to memory constraints. In this paper we use occurrences of entity-relation pairs in the dataset to construct a joint learning model and to increase the quality of sampled negatives during training. We show on three standard datasets that when these two techniques are combined, they give a significant improvement in performance, especially when the batch size and the number of generated negative examples are low relative to the size of the dataset. We then apply our techniques to a dataset containing 2 million entities and demonstrate that our model outperforms the baseline by 2.8% absolute on hits@1. 1 Introduction A Knowledge Graph (KG) is a collection of facts which are stored as triples, e.g. Even though knowledge graphs are essential for various NLP tasks, open domain knowledge graphs have missing facts.

arxiv preprint arxiv, dataset, proceedings, (15 more...)

arXiv.org Machine Learning

Oct-25-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.05)
- Europe
  - France (0.05)
  - Germany > Berlin (0.04)
  - United Kingdom
    - Scotland > City of Edinburgh
      - Edinburgh (0.24)
    - England > Cambridgeshire
      - Cambridge (0.24)
  - Portugal > Lisbon
    - Lisbon (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Semantic Networks (1.00)
  - Natural Language (1.00)
  - Machine Learning > Statistical Learning (0.96)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found