Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints

Open in new window