Video Captioning with Guidance of Multimodal Latent Topics

Open in new window