Revising Multimodal VAEs with Diffusion Decoders

Open in new window