Self-Evolution Fine-Tuning for Policy Optimization