Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning

Open in new window