Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations