TopViewRS: Vision-Language Models as Top-View Spatial Reasoners