Dueling Bandits with Team Comparisons Lee Cohen

Neural Information Processing Systems 

Multi-arm bandits (MAB) is a classical model of decision making under uncertainty. In spite of the simplicity of the model, it already incorporates the essential tradeoff between exploration and exploitation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found