Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation

Open in new window