Learning Robust Options by Conditional Value at Risk Optimization
Takuya Hiraoka, Takahisa Imagawa, Tatsuya Mori, Takashi Onishi, Yoshimasa Tsuruoka
–Neural Information Processing Systems
In the reinforcement learning context, anOption means a temporally extended sequence of actions [30],andisregarded asuseful formanypurposes, such asspeeding uplearning, transferring skills across domains, and solving long-term planning problems.
Neural Information Processing Systems
Feb-12-2026, 02:36:47 GMT
- Country:
- Asia
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
- Middle East > Jordan (0.04)
- Japan > Honshū
- North America > Canada
- Asia
- Technology: