Frequency-enhanced Data Augmentation for Vision-and-Language Navigation--- -- Supplemental Material--- -- Keji He
–Neural Information Processing Systems
Table 1 presents the impacts of different random seeds for sampling the interference images. Experiments in the main manuscript are based on seed-1 which has an average performance. Figure 1: Navigation examples in normal and high-frequency perturbed scenes. In the examples shown in Figure 4, both models obtained similar textual attention. In Figure 6, according to the given instruction, the agent should turn left to enter the room corresponding to the second view.
Neural Information Processing Systems
Oct-8-2025, 03:06:27 GMT
- Technology: