Grounding Acoustic Echoes in Single View Geometry Estimation