TightRegretBoundsforModel-Based Reinforcement LearningwithGreedyPolicies