Detection-Fusion for Knowledge Graph Extraction from Videos