Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents