Adapting multi-armed bandits policies to contextual bandits scenarios

Open in new window