EfficientFirst-OrderContextualBandits: Prediction,Allocation,andTriangularDiscrimination
–Neural Information Processing Systems
On the technical side, we show that the logarithmic loss and an informationtheoretic quantity called thetriangular discriminationplay a fundamental role in obtaining first-order guarantees, and we combine this observation with new refinements tothe regression oracle reduction framework ofFoster and Rakhlin [29].
Neural Information Processing Systems
Feb-10-2026, 06:05:09 GMT