Regret Minimization in MDPs with Options without Prior Knowledge

Neural Information Processing Systems 

Recent works leveraged the mapping of Markov decision processes (MDPs) with options to semi-MDPs (SMDPs) and introduced SMDP-versions of exploration-exploitation algorithms (e.g.,

Similar Docs  Excel Report  more

TitleSimilaritySource
None found