Generalised Entropy MDPs and Minimax Regret

Androulakis, Emmanouil G., Dimitrakakis, Christos

Dec-10-2014–arXiv.org Machine Learning

Bayesian methods suffer from the problem of how to specify prior beliefs. One interesting idea is to consider worst-case priors. This requires solving a stochastic zero-sum game. In this paper, we extend well-known results from bandit theory in order to discover minimax-Bayes policies and discuss when they are practical.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

Dec-10-2014

arXiv.org PDF

Add feedback

Country:
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology
  - Game Theory (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (0.48)
    - Machine Learning > Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found