EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL

Neural Information Processing Systems 

To tackle the reward sparsity issue, one idea is to densify the reward by decomposing the general goal into sub-goals and rewarding them individually.