Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring

Neural Information Processing Systems 

Partial monitoring (PM) is a general sequential decision-making problem with limited feedback ( Rus-tichini, 1999; Piccolboni and Schindelhauer, 2001).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found