FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions

Open in new window