Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs
Han Shao, Xiaotian Yu, Irwin King, Michael R. Lyu
–Neural Information Processing Systems
In linear stochastic bandits, it is commonly assumed that pa yoffs are with sub-Gaussian noises. In this paper, under a weaker assumption on noises, we study the problem of lin ear stochastic b andits with he avy-t ailed payoffs (LinBET), where the distributions have finite moments of order 1+ ϵ,f o rs o m e ϵ (0, 1] .W e rigorously analyze the regret lower bound of LinBET as Ω( T
Neural Information Processing Systems
Nov-20-2025, 14:42:20 GMT
- Technology: