Goto

Collaborating Authors

 drawingbot




Turbo Learning for CaptionBot and DrawingBot

Huang, Qiuyuan, Zhang, Pengchuan, Wu, Dapeng, Zhang, Lei

Neural Information Processing Systems

We study in this paper the problems of both image captioning and text-to-image generation, and present a novel turbo learning approach to jointly training an image-to-text generator (a.k.a. CaptionBot) and a text-to-image generator (a.k.a. DrawingBot). The key idea behind the joint training is that image-to-text generation and text-to-image generation as dual problems can form a closed loop to provide informative feedback to each other. Based on such feedback, we introduce a new loss metric by comparing the original input with the output produced by the closed loop. In addition to the old loss metrics used in CaptionBot and DrawingBot, this extra loss metric makes the jointly trained CaptionBot and DrawingBot better than the separately trained CaptionBot and DrawingBot. Furthermore, the turbo-learning approach enables semi-supervised learning since the closed loop can provide peudo-labels for unlabeled samples. Experimental results on the COCO dataset demonstrate that the proposed turbo learning can significantly improve the performance of both CaptionBot and DrawingBot by a large margin.


Turbo Learning for CaptionBot and DrawingBot

Huang, Qiuyuan, Zhang, Pengchuan, Wu, Dapeng, Zhang, Lei

Neural Information Processing Systems

We study in this paper the problems of both image captioning and text-to-image generation, and present a novel turbo learning approach to jointly training an image-to-text generator (a.k.a. CaptionBot) and a text-to-image generator (a.k.a. DrawingBot). The key idea behind the joint training is that image-to-text generation and text-to-image generation as dual problems can form a closed loop to provide informative feedback to each other. Based on such feedback, we introduce a new loss metric by comparing the original input with the output produced by the closed loop. In addition to the old loss metrics used in CaptionBot and DrawingBot, this extra loss metric makes the jointly trained CaptionBot and DrawingBot better than the separately trained CaptionBot and DrawingBot. Furthermore, the turbo-learning approach enables semi-supervised learning since the closed loop can provide peudo-labels for unlabeled samples. Experimental results on the COCO dataset demonstrate that the proposed turbo learning can significantly improve the performance of both CaptionBot and DrawingBot by a large margin.


You've seen the hype. Now you're curious. Why not have a crack at AI using this online lab...

#artificialintelligence

Microsoft has thrown open the doors to its AI Lab, a suite of beginner projects to help developers learn machine learning. There are five different experiments that cover computer vision, natural language processing, and drones. "Each lab gives you access to the experimentation playground, source code on GitHub, a crisp developer-friendly video, and insights into the underlying business problem and solution," according to Microsoftie Tara Shankar Jana on Tuesday. The first one is is the DrawingBot. It teaches developers about generative adversarial networks (GANs), a popular type of neural network that learns to create similar content to the data it was trained on.