Improved RegretAnalysisforVariance-Adaptive LinearBanditsandHorizon-FreeLinearMixture MDPs

Neural Information Processing Systems 

In online learning problems, exploiting low variance plays an important role in obtaining tight performance guarantees yet ischallenging because variances are often not known a priori. Recently, considerable progress has been made by Zhangetal.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found