Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning? Jialu Gao
–Neural Information Processing Systems
Subsequently, LfV oid trains an ensembled goal discriminator on the generated image to provide reward signals for a reinforcement learning agent, guiding it to achieve the goal.
Neural Information Processing Systems
Feb-15-2026, 01:12:51 GMT