Landmark-RxR: SolvingVision-and-Language NavigationwithFine-GrainedAlignmentSupervision

Feb-7-2026, 08:05:27 GMT–Neural Information Processing Systems

In Vision-and-Language Navigation (VLN) task, an agent is asked to navigate inside 3D indoor environments following given instructions. Cross-modal alignment is one of the most critical challenges in VLN because the predicted trajectory needs to match the given instruction accurately.

machine learning, natural language, trajectory, (18 more...)

Neural Information Processing Systems

Feb-7-2026, 08:05:27 GMT

Conferences PDF

Add feedback

Country:
- Asia > China > Beijing > Beijing (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language (0.68)

Duplicate Docs Excel Report

Title
Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision

Similar Docs Excel Report more

Title	Similarity	Source
None found