Goto

Collaborating Authors

 Reinforcement Learning



Synthesize Policies for Transfer and Adaptation across Tasks and Environments

Neural Information Processing Systems

Wefurther propose newtraining methods todisentangle the embeddings, making them both distinctive signatures of the environments and tasks and effective building blocks for composing the policies.


Learning Robust Options by Conditional Value at Risk Optimization

Neural Information Processing Systems

In the reinforcement learning context, anOption means a temporally extended sequence of actions [30],andisregarded asuseful formanypurposes, such asspeeding uplearning, transferring skills across domains, and solving long-term planning problems.