FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions