A Good Score Does not Lead to A Good Generative Model