Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment

Open in new window