Second Order Bounds for Contextual Bandits with Function Approximation

Open in new window