How Can Large Language Models Understand Spatial-Temporal Data?