Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies