Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space

Neural Information Processing Systems 

Bayes' rule is used to produce a parameter estimate, which then decides the policy

Similar Docs  Excel Report  more

TitleSimilaritySource
None found