Diverse Preference Learning for Capabilities and Alignment