Improved RegretAnalysisforVariance-Adaptive LinearBanditsandHorizon-FreeLinearMixture MDPs

Open in new window