Regret Bounds for Risk-Sensitive Reinforcement Learning