Adaptive Margin RLHF via Preference over Preferences