Reinforcement Learning for Personalized Dialogue Management