Dynamic Noise Preference Optimization for LLM Self-Improvement via Synthetic Data