Unified Multimodal Model with Unlikelihood Training for Visual Dialog

Open in new window