SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models