Efficient Post-Training Refinement of Latent Reasoning in Large Language Models

Open in new window