Active Preference Learning for Large Language Models