Commonsense for Zero-Shot Natural Language Video Localization

Open in new window