Learning Robust Options by Conditional Value at Risk Optimization

Takuya Hiraoka, Takahisa Imagawa, Tatsuya Mori, Takashi Onishi, Yoshimasa Tsuruoka

Neural Information Processing Systems 

Options are generally learned by using an inaccurate environment model (or simulator), which contains uncertain model parameters.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found