Training Chain-of-Thought via Latent-Variable Inference

Open in new window