Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic

Open in new window