Gumbel Counterfactual Generation From Language Models