Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits

Neural Information Processing Systems 

The agent pulls an arm and observes the corresponding return provided by the environment.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found