Generalized Policy Elimination: an efficient algorithm for Nonparametric Contextual Bandits

Open in new window