RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment