Tight performance bounds on greedy policies based on imperfect value functions

Open in new window