Generalizing Goal-Conditioned Reinforcement Learningwith Variational Causal Reasoning