ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment