Adaptive Querying for Reward Learning from Human Feedback