Preference Optimization with Multi-Sample Comparisons