Classifier-Guided Captioning Across Modalities