Efficient Post-Training Refinement of Latent Reasoning in Large Language Models