Adapting Interleaved Encoders with PPO for Language-Guided Reinforcement Learning in BabyAI

Open in new window