Answering Visual-Relational Queries in Web-Extracted Knowledge Graphs
Oñoro-Rubio, Daniel, Niepert, Mathias, García-Durán, Alberto, González, Roberto, López-Sastre, Roberto J.
–arXiv.org Artificial Intelligence
A visual-relational knowledge graph (KG) is a multi-relational graph whose entities are associated with images. We explore novel machine learning approaches for answering visual-relational queries in web-extracted knowledge graphs. To this end, we have created ImageGraph, a KG with 1,330 relation types, 14,870 entities, and 829,931 images crawled from the web. With visual-relational KGs such as ImageGraph one can introduce novel probabilistic query types in which images are treated as first-class citizens. Both the prediction of relations between unseen images as well as multi-relational image retrieval can be expressed with specific families of visual-relational queries. We introduce novel combinations of convolutional networks and knowledge graph embedding methods to answer such queries. We also explore a zero-shot learning scenario where an image of an entirely new entity is linked with multiple relations to entities of an existing KG. The resulting multi-relational grounding of unseen entity images into a knowledge graph serves as a semantic entity representation. We conduct experiments to demonstrate that the proposed methods can answer these visual-relational queries efficiently and accurately.
arXiv.org Artificial Intelligence
May-3-2019
- Country:
- North America > United States > Massachusetts (0.28)
- Genre:
- Research Report (0.64)
- Overview (0.46)
- Industry:
- Technology: