GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language

Open in new window