Vision-and-Language Navigation via Causal Learning