Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds