Contextual bandits with entropy-based human feedback

Open in new window