Goto

Collaborating Authors

 Reinforcement Learning



Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning? Jialu Gao

Neural Information Processing Systems

Subsequently, LfV oid trains an ensembled goal discriminator on the generated image to provide reward signals for a reinforcement learning agent, guiding it to achieve the goal.




Multi-Step Generalized Policy Improvement by Leveraging Approximate Models Lucas N. Alegre 1, 2 Ana L. C. Bazzan 1 Ann Now รฉ 2 Bruno C. da Silva 3 1

Neural Information Processing Systems

We introduce a principled method for performing zero-shot transfer in reinforcement learning (RL) by exploiting approximate models of the environment. Zero-shot transfer in RL has been investigated by leveraging methods rooted in generalized policy improvement (GPI) and successor features (SFs).






Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning

Neural Information Processing Systems

Object-oriented representations in reinforcement learning have shown promise in transfer learning, with previous research introducing a propositional objectoriented framework that has provably efficient learning bounds with respect to samplecomplexity.