Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback