Compound Returns Reduce Variance in Reinforcement Learning