Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization

Open in new window