Report on Representations for Multimodal Generation Workshop