Enhancing Long-Chain Reasoning Distillation through Error-Aware Self-Reflection

Open in new window