Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training

Open in new window