MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling