DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training