Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation