Few-Shot Visual Grounding for Natural Human-Robot Interaction

Open in new window