Instance-Level Semantic Maps for Vision Language Navigation