Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning