Generalizing Multimodal Variational Methods to Sets