Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor

Neural Information Processing Systems 

We introduce the Blackwell discount factor for Markov Decision Processes (MDPs). Classical objectives for MDPs include discounted, average, and Blackwell opti-mality.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found