Oceania
CounterfactualVision-and-LanguageNavigation: UnravellingtheUnseen
Aprominent challenge is to train an agent capable of generalising to new environments attest time, rather than one that simply memorises trajectories and visual details observed during training. We propose a new learning strategy that learns both from observations and generatedcounterfactual environments.
1289f9195d2ef8cfdfe5f50930c4a7c4-Supplemental-Conference.pdf
Additionally, prompt-based FT with the PCP outperforms state-of-the-art semi-supervised approaches with greater simplicity, eliminating the need for an iterative process and extra data augmentation. Our further analysis explores the performance lower bound of the PCP and reveals that the advantages of PCP persist across different sizes of models and datasets.