Disentangling Extraction and Reasoning in Multi-hop Spatial Reasoning