DecentralizedCooperativeStochasticBandits

Neural Information Processing Systems 

In the most basic setting of this problem, an agent has to pull one among a finite set of arms (or actions), and she receives a reward that depends on the chosen action.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found