From to : Multidimensional Supervision of Reasoning Process for LLM Optimization