A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents

Open in new window