Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling

Neural Information Processing Systems 

Learning the minimum/maximum mean among a finite set of distributions is a fundamental sub-task in planning, game tree search and reinforcement learning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found