Larger or Smaller Reward Margins to Select Preferences for Alignment?