UNITER-Based Situated Coreference Resolution with Rich Multimodal Input

Open in new window