Learning Multiple Markov Chains via Adaptive Allocation

Mohammad Sadegh Talebi, Odalric-Ambrym Maillard

Neural Information Processing Systems 

Using ideas from the Multi-Armed Bandit (MAB) literature, previous works (e.g., [

Similar Docs  Excel Report  more

TitleSimilaritySource
None found