Goto

Collaborating Authors

 Reinforcement Learning



2bce32ed409f5ebcee2a7b417ad9beed-Paper.pdf

Neural Information Processing Systems

We propose RLlib Flow, a hybrid actor-dataflow programming model for distributed RL, and validate its practicality by porting the full suite of algorithms in RLlib, a widely adopted distributed RL library.






AutomaticDataAugmentationforGeneralizationin ReinforcementLearning

Neural Information Processing Systems

Generalization to new environments remains a major challenge in deep reinforcement learning (RL). Current methods fail to generalize to unseen environments even when trained on similar settings [19, 51, 71, 11, 21, 12, 60].



284afdc2309f9667d2d4fb9290235b0c-Paper-Conference.pdf

Neural Information Processing Systems

Theseoutcome-conditioned imitationlearningmethodsare appealing because of their simplicity, strong performance, and close ties with supervisedlearning.