Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning