A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms

Neural Information Processing Systems 

The MAB paradigm provides a succinct abstraction of the quintessential exploration vs. exploitation trade-offs inherent in many sequential decision making problems.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found