Reward Design via Online Gradient Ascent