Video sentence grounding with temporally global textual knowledge

Open in new window