The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning

Open in new window